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NUCLEIC ACIDS ENCODI NG ZIXC METAL LC PROTEASES 

Background of the Invention 

This invention relates to isolated nucleic acid -molecules 

5 whicr. encode proteins belonging to a 2inc met alloprotease family. 
The zinc metalloproteases have been implicated in a variety of 
diseases and development disorders that involve* enhanced or 
depressed proteolysis of components of the extracellular matrix, 
receptors, or other extracellular molecules. 

10 More particularly, the invention relates to isolated nucleic 

acid molecules encoding proteins belonging to a subfamily of zinc 
metalloproteases referred to as "ADAMTS" , an abbreviation for A 
Disintegrin-like And Metalloprotease domain with ThromboSpcndin type 
I motifs. Proteins in the ADAMTS subfamily all possess a Zn 

15 protease catalytic site consensus sequence (KEXXH+H) , which suggests 
an intact catalytic activity for each of these proteins. The ADAMTS 
proteins also have putative N- terminal signal peptides and lack 
transmembrane domains, which suggests that the proteins in this 
subfamily are secreted. The proteins in the ADAMTS subfamily also 

20 possess at least one thrombospondin type (TSP1) motif, which suggests 
a binding of these proteins to components of the extracellular matrix 
i.ECM) or to cell surface components. 

Members of the ADAMTS subfamily of proteins are ADAMTS- 1, 
ADAMTS - 2 , ADAMTS- 3. and ADAMTS - 4 . ADAMTS -1 protein is selectively 

15 expressed m colon 26 adenocarcinoma cachexigenic sublines. ADAMTS - 1 
rnSMA is induced by the inflammatory cytokine mterleukin- 1 in vitro 
and by intravenous administration of 1 1 popolysaccharide in vivc. 
Thus,, the ADAMTS - 1 protein is believed to play a role m tumor 
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cleavage of native triple -helical procollagen I and procollagen II. 
The ADAMTS-2 ?ro:e:n also has an affinity for collagen XIV. Lack of 
che ADAMTS-2 protein is known to cause derxatosparax;s in cattle, or 
Eh lers -Dan les syndrome type VI I C (EDS-VIIC) in hunans. EDS-VIIC is 
5 characterized clinically by severe skin fragility, and biochemically 
by the presence in skin of procollagen which is incompletely 
processed at the amino terminus. Thus, it is believed that the 
ADAMTS-2 protein plays a role in processing of procollagen tc mature 
collagen, an essential step for correct assembly of collagen into 

10 collagen fibrils. The ADAMTS - 3 protein is similar in sequence to 
ADAMTS-2 and may have similar function. 

The ADAMTS-4 protein catalyzes cleavage of the core protein of 
the extracellular matrix proteoglycan, aggrecan. Aggrecan 
degradation is an important factor in the erosion of articular 

15 cartilage in arthritic disease. Aggrecan fragments have been 

identified in cultures undergoing carcilage matrix degradation and in 
arthritic synovial fluids. Therefore, overexpress ion or activation 
1C of ADAMTS-4 protein may be related to both inflammatory and non- 
inflammatory arthritis. 

2 0 On the basis of the structure, location, and the demonstrated 

proteolytic activity of ADAMTS proteins 1-4, it is expected that 
ether -embers cf the ADAMTS subfamily play a rele in the cleavage of 
proteoglycan cere proteins tnat are found in the extracellular 
matrix, such as, for example, versican, trevican, neuracan, 110-2, 

25 acgrecan, as well as molecules such as collagen. It is also expected 
that other rrembers of the AT AMIS subfamily play a rele in 
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5ubfar.ily cf prcteir.s, the nucleic acids chat encode such proteins, 
and antibodies that are specific for such prcteins . Such molecules 
are useful research tools for studying develc pment of the 
extracellular matrix during embrycgenesis and fetal development, and 
5 for studying disorders or diseases that are characterized by improper 
development of the extracellular matrix or enhanced or reduced 
destruction of the extracellular matrix. Such molecules, 
particularly the nucleic acids and the antibodies, are also useful 
tools for diagnosing such diseases or for monitoring the efficacy of 

10 therapeutic agents that have been developed to treat such diseases. 

Siiinmai'y of the Invention 
The present invention provides novel , isolated, and 
substantially purified proteins having the characteristics of an 

15 AD ANTS protein. The novel prcteins are referred to hereinafter 
individually as "ADAMTS- 5", "ADAMTS - 6 " , "ADAMT3-7", "ADAMTS - 8" , 
"ADAMTS - 9" and "ADAMTS -10", and collectively as "ADAMTS-N". In one 
embodiment, the ADAMTS - 5 protein is a mature mouse protein which 
comprises amino acid 231 through amino acid 930 of the sequence set 

20 forth set forth m SEQ ID NO: 2. In another erriodiir,ent, ADAMTS- 5 is 
a human ADAMTS - 5 protein which comprises amino acid 1 through amino 
acid 516 Df the sequence set forth in SEQ ID NO: 4. In one 
embodiment, mature human ADAMTS - 6 protein comprises amino acid 245 
through amino acid 660 of SF.Q ZD NO : 6. In one embodiment . mature 

25 human ADAMTS - 7 protein comprises amine acid 2 3 3 through amino acid 
557 cf the sequence set forth in SEQ ZD NO: a. In one embodiment, 
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is a human protein which comprises amino acid 236 through ammo acid 
1382 of the sequence set forth m SEQ ID V,0 14. In another 
embodiment , ADAMTS -9 protein is a mouse protein which comprises amine 
acid 1 through amine acid 974 of the sequence set forth in SEQ ID NO : 
5 16. In one embodiment, mature ADAMTS 10 protein is a human protein 
which comprises amino acid 212 through amino acid 1081 of the 
sequence set forth in SEQ ID NO: 18. In another embodiment, ADAMTS - 
IC protein is a mouse protein which comprises amino acid 1 through 
amino acid 54 7 of the sequence set forth in SEQ ID "0 ■ 2 0 

10 The present invention also provides isolated polynucleotides 

which encode an ADAMTS -N protein or a variant thereof, polynucleotide 
sequences complementary to such polynucleotides, vectors containing 
such polynucleotides, and host cells transformed or transfected with 
such vectors. The present invention also relates to antibodies which 

15 are immunospeci f ic for one or more of the ADAMTS -N proteins. The 
present invention also relates to a protein referred to hereinafter 
as ADAMTS -Rl (ADAK-T-S Related protein-1) and the polynucleotides 
which encode such protein. In one embodiment, the ADAMTS -Rl protein 
comprises amino acid 1 through amino acid 525 of the sequence set 

2 0 forth in SEQ . ID NO: 22. 

Brief D escripticr. cf the Dr awings 
Figure 1 shews an amine acid sequence \ SEQ ID MO : 2 ; of a f ul 1 - length 

mouse ADAMTS-:: prctem arc a nucleic ac^d sequence ■. SZi - - l - - 

which encodes such protein. 

25 Figure 2 shows an amino acid sequence iSEQ ID NO : 4 ) of a partial 

human ADAYTS - 5 protein and a nucleic acid sequence > SEQ ID KO : 3) 

which encodes such protein. 
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Figure 4 shows an aT.ir.o acid sequence (SEC ID NO : 3 ) of a full-length 
hurar. ADAMTS-7 protein and a nucleic acid sequence !SEQ ID NO: 7) 
which encodes such protein. 

Figure 5 shews an amino acid sequence (SEQ ID NO 10) of a full- 
5 length mouse ADAMTS-S protein and a nucleic acid sequence (SEQ ID 
HO: 9) which encodes such protein. 

Figure 6 shews an anins acid sequence (SEQ ID NO; 12) of a partial 
human ADAMTS-3 protein and a nucleic acid sequence (SEQ ID NO: 11) 



1C Figure 7 shews an amino acid sequence (SEQ ID NO ■ 14), of a full- 
length human ADAMTS-9 protein and a nucleic acid sequence (SEQ ID NO: 
13) Which encodes such protein. 

Figure 8 shows an anin:> acid sequence (SEQ ID NO. 16) of a partial 
rouse ADAMTS-9 protein and a nucleic acid sequence (SEQ ID NO: 15} 
15 which encodes such amino acid sequence. 

Figure 9 shows an amino acid sequence (S2Q ID NO 18) of a full -length 
human ADAMTS-10 protein and a nucleic acid sequence (SEQ ID NO: 17} 
which encodes such protein. 

Figure 10 show's an amino acid sequence [SEQ ID NO: 20) of a partial 
20 mouse ADAMTS-10 protein and a nucleic acid sequence (SEQ ID NO: 13} 
which encodes such amir.c acid sequence. 

Ficure 11 shews an amine acid sequence 'SEQ ID NC : 22 ; cf a ful 1 
length AD AM 7 S - P 1 prctr-in ar.d a nucleic acid sequence 'SEQ 12: N2 : I 1 • 
which encodes such protein. 
25 Figure 12 depicts the cloning strategy used for isolation of a. mouse 
and human AD AM 7 S - 5 clNAs b. human AD AM T 5 - 6 cDNA ar.d c. human ADAM7S - " 
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dottec lines. DNA scale marker ; m bp) and ammo acid scale m.arker 
are at upper right. Location of the probe used for in s:t^ 
hybridization (ISK) is shown in a. 

Figure 13 shows the predicted amino acid sequences cf a. the mouse 
5 and human ADAMTS-5 proteins (alignment shows mouse sequence above, 
partial human sequence below) b. ADAMTS-6, and c. ADAMTS-7. The 
active- site sequences and proposed Ket-turn are enclosed in boxes. 
Potential furin cleavage site (si are indicated by arrows. 



10 inked glyccsylat ion are cverlined. Cysteine residues within the 
context of an KM?- like ^cysteine switch" are indicated by the solid 
circles. Other cysteine residues are indicated by asterisks. The 
prodomain extends until the furin cleavage site, and the catalytic 
domain extends from the furin cleavage site to the approximate start 

15 of the disintegrin- 1 ike sequence (Lis) . The star: of the spacer 

domain is indicated; the region between the N- terminal TS domain and 
the spacer domain is the cysteine - r ich domain. The single letter 
amino acid code is used. 

Figure 14 shows Northern analysis cf expression cf ADAKTS-5, 6 and 7. 
20 RilA kilobase markers are show, at left of each autoradicgram, and 

1 1 s sue origin i a : nd : cated above each lane. a. Mouse embryo northern 

b'.ots . b. Human mult iple adult tissue northern clots. 

Figure 15 is a schema t i o representor icn cf the d cm a i n stn.r'-jre of 

ADAKTS-R1 protein as compared tc ADAMT3-1 protein. 
2 5 Figure 16 shows an amine acid sequence (SEQ ID NO: 24) cf an 

alternative embed i men t cf a full ler.ath human ;OAl-'.TS-lC protein and a 
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■:SEQ ID 110 : 25; which encodes such prctem. 

Figure IS is a schematic representation of the domain structure of 
human ADAMTS - 5b prctein as compared to human and mouse ADAXTS -9 
protein . 

5 Detailed Description of the Invention 

AD ANTS - N Proteins 

The present invention relates to novel, isolated, substantially 

purified, mammalian proteins belonging to the ADAXTS subfamily of 

metalloproteases . As used herein, the term "substantially purified" 

-^of^-rc m ^ rmfpir rha 4 - is removed from its natural environment, 

— iT - 

isolated or separated, and at least 60% free, preferably 75% free, 
and most preferably 90% free from other components with which it is 
naturally associated. 

The novel mammalian proteins are ADAXTS-5, ADAXTS- 6, ADAMT5-7, 

15 ADAXTS - 8 , ADAMTS -9 and ADAXTS - 10 , collectively ADAXTS -N. In one 
embodiment, the ADAMTS -5 protein is a mature mouse protein which 
comprises amino acid 2S1 through ammo acid 93 0 of the sequence set 
forth in SSQ ID NO: 2. In another embodiment, the AEAXTS-5 protein 
is a human prctein which comprises amino acid 1 through amino acid 

20 518 of the sequence set forth m SSQ ID MO: 4. In one embodiment, 
ADAXTS - 6 protein is a mat -Lire human protein which comprises amine 
acid 245 through amino scic BeO cf SEQ ID i;0 : 6 . In one embodiment, 
tr.e ADAXTS - 7 protein la a nature human protein which comprises amino 
srid 2 13 through amino acid of the sequence set forth in SS'~ ID 

25 :,'0: 6. In one embodiment, the ADAXTS - 6 protein is a mature mouse 
protein which comprises amino acid 229 through amino acid 905 of the 
sequence set forth 1:1 SEQ ID 11 D : 1C. In another embodiment, the 
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forth m SEC. ID NO: 14. Ir. another embodiment, the ADAPTS - 9 protein 
is a mouse protein which comprises amino acid 1 through ammo acid 
S74 of the sequence set forth in SEQ ID NO: 16. In another 
embodiment, the ADAMTS-9 designated ADAMTS-9b is a human protein 
5 which is comprised of 1934 amino acids as set forth in SEQ ID NO 26. 
In one embodiment, the ADAMTS -10 protein is a mature human protein 
which comprises amino acid 212 through amino acid IC81 of the 
sequence set forth in SEQ ID NO: 18. In another embodiment the 
ADAMTS- 10 protein is a mouse protein which comprises amino sriti *l 

10 through amino acid 525 of the sequence set forth in SEQ ID NO: 20. In 
another embodiment, the ADAPTS- 10 protein is a human protein which is 
comprised of 1072 amino acids as set forth in SEQ ID NO 24. 

All of the novel ADAMTS -N proteins starting at the amine 
terminus comprise a signal sequence followed by a putative pro region 

15 which contains a consensus sequence for furin cleavage {except for 
ADAMTS -10), a catalytic domain, a domain of 60-90 residues with 35 to 
45% similarity to snake venom dis mtegr ins , a TS module, a cysteine 
rich domain containing multiple conserved cysteine residues, a spacer 
domain, and one or multiple C terminal TS modules. (See Figure 12.) 

2 0 As determined using the BLAST software from the National Center for 
7i : ct echnc I eg;/ Information, the predicted rr.ature forms of the ADAT-'TS-N 
proteins shew an overall 2 0 - 3 C % s :mi 1 ari ty to each other and to 
ADAYTS - 1 • 4 , although this may re considerably higher or lower for 
individual domains as described below. 

2 5 The ADAMTS -N proteins also encompass variants of the ADAPTS -N 

proteins shown in Figs. 1 - 1 C . A "var : ant" as used herein, refers t o 
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m which cne or nore of the ar.mc ac^ds in the reference sequence is 
deleted or substituted, or cne or r.cre amino acids are inserted into 
the sequence of the reference amino acid sequence. As a result of 
the alterations, the variant protein has an amino acid sequence which 
5 is at least 95% identical to the reference sequence, preferably, at 
least 97% identical, more preferably at least 98% identical, most 
preferably at least 99% identical to the reference sequence. Variant 
sequences which are at least 95% identical have no more than 5 
alterations i.e. ar.v combination of deletions insertions or 

10 substitutions, per 100 amino acids of the reference sequence. 

Percent identity is determined by comparing the amino acid sequence 
of the variant with the reference sequence using MEGALIGN project in 
the DKA STAR program. Sequences are aligned for identity 
calculations using the method of the software basic local alignment 

15 search tool in the ELAST network service (the National Center for 
3iotechnolcgy Information, 3ethesda, MD) which employs the method of 
Altschul, S. ?. , Gish, VI., Killer, W., Xyers, E. w. & Lipr.an, D. J. 
(1990) J. Mol . Biol. 2 15, 403 -410. Identities are calculated by the 
Align program (DNAstar, Inc.) In all cases, internal gaps and amino 

2C acid insertions in the candidate sequence as aligned are net ignored 
when making the identity calculation. 

sub st i tut icr.i? , it i s preferred t h a t the substitutions be censervat ive 
amino acid substitutions, in which the substituted ammo acid has 
25 similar structural or chemical properties with the corresponding 
amino acid in the reference sequence. Ey \ay of example, 



u t i 



t 



W O HI 1 KT4 



PC T I SOO-212.3 



-10- 

or.e acidic residue, e.g. glutamic acid or aspartic acid, with 
another; replacement cf or.e amide - ccr.ramir.g residue, e.g. asparagme 
and glut amine, with another,- replacement of one aromatic, residue, 
e.g. phenylalanine and tyrosine, with another; replacement of one 
5 basic residue, e.g. lysine, arginine and histidine, with another; and 
replacement of one small amino acid, e.g., alanine, serine, 
threonine, methionine, and glycine, with another. 

The alterations are designed not to abolish the 

imr-i-rnre^Ct- i'.Htv of the vqr'qrt nrnt;piTi \*H t h a nt A hnr. i P"* -h^f hind to 

10 the reference protein. Guidance in determining which amino acid 
residues may be substituted, inserted or deleted without abolishing 
immunoreactivity of the variant protein with an antibody specific for 
the respective reference protein are found using computer programs 
well knowmi m the art, for example, r;NASTAR software. 

15 The ADAM T 3 -N proteins also encompass fusion proteins comprising 

an ADAMTS-N protein and a tag, i.e., a second protein or one or more 
amino acids, preferably from about 2 to 65 amino acids, more 
preferably from about 34 to about 62 amino acids, which are added to 
the amino terminus of, the carboxy terminus of, or any point witnm 

2C the amino acid sequence cf an ADAMTS-N protein, or a variant of such 
protein. Typically, such additions ire made tc stabilize the 
resulting fusion protein or to sirpl rfy pun f icac i cr. of an expressed 
recombinant form of the corresponding ADAXTS - :■: protein or variant of 
such protein. Such tags are known m the art. Representative 

2 5 examples cf such tags include sequences which encode a series of 
hi st id i ne residues, the epitope tag FLAG , the Herpes simplex 
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the respective ATJAMTS - N protein are altered by pcsttrar.siaticn 
processes or synthetic nethocs . Examples of such modifications 
include, cut are not limited to, acetylaticn, amidaticn, AD? - 
ribcsylation, ccvalent attachment of flavin, covalent attachment of a 
5 heme moiety, covalent attachment of a nucleotide cr a lipid, cross- 
linking gamma - carboxylat ion , glycosylat ion, hydroxy la t ion, 
lodmation, methyl at ion , myristoylation, oxidation, pegylaticn, 
proteolytic processing, phosphorylation, prenylation, racemization , 
s::"*'a-inn. and rransf er -R!JA mediated additions of amino acids to 

10 proteins such as arginylation and ubiqui t inat ion . 

The ADAMTS-N proteins are immunogenic and, thus, are useful for 
preparing antibodies. Such antibodies are useful for identifying and 
diagnosing disorders which are associated with decreased expression 
or activity or increased expression of an ADAMTS-:! protein. The 

15 ADAMTS-N protein may also be useful for treating such disorder. 

Diseases involving enhanced or depressed proteolysis of the 
core proteins of the extracellular may involve enhanced expression or 
activity or decreased expression cr activity of one or mere ADAMTS-N 
proteins. Thus, ArJAMTS-N proteins may be used to identify drugs, 

20 polypeptides, auto - ant ibodi es , or other natural compounds which bind 
to an AD AT-: 7 S - N prcrein with sufficient affinity tc block or 

a s saved in the presence and the acser.ee cf the putative inhibit cr or 
facilitator using any cf a variety cf protease assays known in the 
25 art. In general, the activity of the ADAK7S-N protein is assayed 
through t he use cf a peptide or protein substrate having a kr.cwn cr 
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side cf :he cleavage sire and with a f lucrescence-quenchmg group on 
the opposite side cf the cleavage site. Upon cleavage by the 
substrate, quenching is eliminated and a detectable signal is 
produced. Alternatively, the substrate is tagged with a colcrimetnc 
5 leaving group that more strongly absorbs upon cleavage. Agents which 
block ADAMTS -N-catalyzed cleavage cf a protein substrate may be 
administered to a subject to block proteolysis of the corresponding 
protein substrate. 
ADA ^T.q-?' Pmfftin 

10 The present invention also relates tc a protein, referred to 

hereinafter as u ADAMTS - Rl " . From its ammo to its carbcxyl terminus, 
ADAMTS -51 comprises a signal peptide sequence, a TS1 module, a 
cysteine -rich domain, a spacer domain, and three 7S1 modules. Thus, 
ADAMTS-Rl has a structure which is related to or similar to an 

15 ADAM7S-N protein, but which lacks a catalytic domain and a 

dis integrm- like domain. In one embodiment, ADAMTS-Rl, protein 
comprises anno acid 1 through amino acid 525 of the amino acid 
sequence, SEQ ID NO; 22, shown in Fig. 11. Such protein has a 3C-4C* 
cverall sequence identity with similar regions of the ADAMTS -N 

20 proteins. The ADAMTS-Rl proteins also encompass variants cf the 
amino acid sequence shown in Fig. 11 and fusion proteins which 
contain the ammo acid sequence shown in Fig. 11 cr a variar.: 
mere of . Cn the basis cf its dcair. organ i rat :cn , it ir. expert ed 
that ADAMTS-RI can bind to extracellular matrix cr cell surface 

25 molecules, including ADAMT3 - K substrates. Thus, it is expected that 
ADAMTS - R 1 can be used as an cell -matrix cr cell -cell adhesion 
molecule cr an AD AM 7 S - M competitive inhibitor. The ADAM7S - Rl 
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expressicn or increased expression of. an ADAM7S-R1 protein. 
Polynuc ieot ides 

The present invention also provides isolated polynucleotides 
which encode the T.arar.alian ADAMTS-N proteins and the mamma liar. 
5 ADAMTS -Rl protein. Figure 1 shows one embodiment of a 

polynucleotide, SEQ ID NO: 1, which encodes the full-length mouse 
ADAMTS -5 protein. Figure 2 shows one embodiment of a polynucleotide 
SEQ ID NO: 3, which encodes a partial human ADAKTS-5 protein. Figur 
3 shews one embodiment of a polynucleotide; SEQ ID KG: 5, which 

1C encodes a full-length human ADAMTS -6 protein. Figure 4 shews one 
embodiment of a polynucleotide; SEQ ID NO: 7, which encodes a full- 
length human AD AM T 3 - 7 protein. Figure 5 shows one embodiment of a 
polynucleotide; SEQ ID HO: 9, which encodes a full-length mouse 
ADAMT3-8 protein. Figure 6 shews one embodiment of a polynucleotide 

15 SEQ ID NO: 11, which encodes a partial human ADAT-1TS - 8 protein. 
Figure 7 shows one embodiment of a polynucleotide; SEQ ID NO: 13, 
which encodes a full-length human ADAMTS - 9 protein. Figure 3 shows 
cne embodiment of a polynucleotide; SEQ ID NO: 15, which encodes a 
partial ADAMTS - 9 protein. Figure 9 shows one embodiment of a 

20 polynucleotide; SEQ ID NO: 17, which encodes a full-length human 
ADAMTS -10 protein. Figure 1C shews cne embodiment of a 
polynucleotide; SEQ ID I.'C : 19, which encodes a partial mouse ADAMTS - 
: : D rc:« :n . Figure '. ~. shows cne embodiment of a pclynuc 1 ecr ide .- DrQ 
ID KG: 21, which encodes a toll - length ADAMTS -Rl protein. 

25 Due to the known degeneracy of the genetic cede wherein more 

than cne coder, can encode the same amine acid, a DMA sequence may 
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in SEQ ID ::0:t. Similarly a DN'A sequence -ay vary from that shewn i 
SEC ID NC3 : 7, S, H, ana 13, and still encode the amine acid 
sequences shown in SEQ ID NOS : 8, 10, 12, and 14, respectively. 
Such variant DNA sequence may result from silent mutations, such as 
5 for example those that occur during PGR amplification or from 
deliberate mutagenesis of a native sequence. 

The present polynucleotides also encompass polynucleotides 
having sequences that are capable cf hybridizing to the nucleotide 
secvence? o p Ftg^ i - l i nrHp'- qrrincpnt- rr,jirii*:nns, preferably 

10 highly stringent conditions. Hybridization conditions are based on 
the melting temperature" 4 ' of the nucleic acid binding complex or 
prcbe, as described in Berger and Kimmel (1987) Guide to Molecular 
Cloning Techniques, Methods m Enzyr.clogy, vol 152, Academic Press. 
The term "stringent conditions, as used herein, is the "stringency" 

15 which occurs within a range from about Tm-5 (5 C below the melting 
temperature cf the probe) to about 2 0 c C below Tm. As used herein 
"highly stringent" conditions employ at least 0.2 x SSC buffer and a 
least 65° C. As recognized m the art, stringency conditions can be 
attained by varying a number of factors such as the length and 

2C nature, i.e., DNA or RNA, of the probe; the length and nature of the 
target sequence , the concent rat icn cf the salt? and other components 
such as fcrmamide, dextran sulfate , and polyethylene glyccl . of the 
hvbr :d: cat ::.r. solution. A 1 '. of these factors may be varied to 
venerate conditions of stringency which are equivalent to the 

25 conditions listed above. 

The oresent colvnuclett ides also encompasses alleles cf the 
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ADAMTS-R1 encoding sequence. Such mutations topically arise from 
natural addition, deletion of substitution of nucleotides in the open 
reading frame sequences. Any gene which encodes an ADAXTS-N prctem 
cr ADAMTS-RZ protein nay have none, one, or several allelic f orr.s . 
5 Such alleles are identified using conventional techniques, such as 
for example screening libraries with probes having sequences 
identical to or complementary with one or more ADAPTS -N 
polynucleotides . 

The present polynucleotides also encompass altered 

1C polynucleotides which encode ADAKTS-K proteins, ADAXTS -H 1 proteins, 
and variants thereof. Such alterations include deletions, additions, 
cr substitutions. Such alterations may produce a silent change and 
result in an ADA.MTS-X protein having the same amino acid sequence as 
the ADAMTS-N protein encoded by the unaltered polynucleotide. Such 

15 alterations may produce a nucleotide sequence possessing ncn- 
naturally occurring ccdons . For exam.ple, codons preferred by a 
particular prckaryotic or eucaryotic host may be incorporated into 
the nucleotide sequences showing Figures 1 -21 to increase the rate 
of expression of the proteins encoded by such sequences. Such 

20 alterations may also introduce new restriction sites into the 

sequence cr result in the production c c an ADAMTS-N 1 or ADAPTS - ? I 
variant . Typically, such alterations are acccm.pl i shed using rite- 
directed mutagenesis. 

The polynucleotides are useful for producing AZAXTS-N' or 

25 A2AKTS-R1 proteins. For example, an RNA molecule encoding an ATAXTS - 
h" nrctein is used in a cell-free translation systeirs to prepare such 
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SV4 0 , bacterial plasmids, phage DNAs ; yeast: plasmids, vectors derived 
from ccrii nations cf plasmids ar.d phage DNAs , viral DNA such as 
vaccinia, adenovirus, fowl pox virus, pseudorabies , bacuicvirus, ar.d 
retrovirus. The D^JA sequence is introduced into the expression 
5 vector by 5 conventional procedures. 

Accordingly, the present invention also relates to recombinant 
constructs comprising one cr more of the present polynucleotide 
sequences. Suitable constructs include, for example, vectors, such 
as a plasnid. phagemid. or viral vector, into which a sptiptp , -h^*- 

10 encodes an ADAMTS-N protein or an ADAMTS-R1 protein has been 

inserted. In the expression vector, the DNA sequence which encodes 
the ADAMT5-N protein is operatively linked to an expression control 
sequence, i.e., a promoter, which directs mRKA synthesis. 
Representative examples of such promoters, include the LTR or SV4 0 

15 promoter, the E. coli lac cr trp, the phage lambda PL promoter and 
other promoters known to control expression of genes in prckaryotic 
or eukaryotic cells or in viruses. The promoter may also he the 
natural promoter of the ATAMTS-N encoding sequence. The expression 
vector, preferably, also contains a riboscme binding site for 

20 translation initiation and a transcription terminator. Preferably, 
the recombinant expression vectors also include an origin cf 
replication and a selectable '-.arker , such as for example, : r.e 
ampicillm resistance gene cf =; . col i to per~.it solr-ction of. 
transformed cells, i.e. cells that are expressing the heterologous 

25 DNA sequences. The polynucleotide sequence encoding the AT.AJY7S - N 
crcte::: is incorporated into the vector in frame with translation 
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,155 5) Molecular Cloning A Laboratory Manual, Cold Spring Harbor 
Press, Plamview, ;■; . Y . and Ausubel, F . M. et al . (1969) Current 
Prccocols in Molecular Biology, John Wile & Sons, New York, NY. 

Polynucleotides encoding an ADAMTS-N cr ADAMTS -Rl protein rr.ay 
5 also be used for diagnostic purposes. The polynucleotides nay be 
used to detect and quantify ATAMTS-N or ADAMTS - Rl gene transcripts in 
biopsied tissues in which enhanced expression or reduced expression 
of the corresponding ADAMTS-N or ADAMTS -Rl gene is correlated with a 
disease. The diagnostic assay nay be used to determine whether 

10 expression is absent, present, or altered and to determine whether 
certain therapeutic agents m.odulate expression of the corresponding 
ADAMTS-N or ADAMTS -Rl gene. 

Also encompassed by the present invention, are single stranded 
polynucleotides, hereinafter referred to as antiser.se 

15 polynucleotides, having sequences which are complementary to the DNA 
and RNA sequences which encode the ADAMTS-N or ADAMTS -Rl proteins. 
The tern complementary as used herein refers to the natural binding 
of the polynucleotides under permissive salt and 5 temperature 
conditions by base pairing. 

2 0 The present invention also encompasses oligonucleotides that 

are used as primers ir. polymerase chair, reaction ;'?CR technologies 
to amplify transcripts of the genes which encode the ADAMTS -N and 
ADAMTS?. -1 proteins cr portions cf euoh transcripts. Preferably, the 
primers comprise 1 S - 3 0 nucleotides, mere preferably 19-25 

2 5 nucleotides. Preferably, the primers have a G-C content cf 40% cr 

greater. Such oligonucleotides are at least S3* complementary with a 
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sense strand or its corresponding antisense strand. Primers which 
are which nave IjC* ccmplexentarity with the antiser.se strand cf a 
double-stranded DKA molecule which encodes an ADAMTS-:J protein have a 
sequence which is identical to a sequence contained within the sense 
5 strand. The identity of primers which are 15 nucleotides in length 
and have full complementarity with a portion of the antisense strand 
of a double -st randed DNA molecule which encodes the ADAMTS-N protein 
is determined using the nucleotide sequences, shown in FIG I - 11 and 
described by the general formula a-b; where a is any integer between 

10 I and the position number of the nucleotide which is located 15 

residues upstream of the 3' end of the sense or antisense strand of 
the cDNA sequences shown in ?IG 1 -11; where b is equal to a+14; and 
where both a and b correspond to che positions of nucleotide residues 
cf the cDNA sequences shown in FIGS 1-11. 

15 The present invention also encompasses oligonucleotides that 

are useful as hybridization probes for for isolating and identifying 
cDNA clones and genomic clones encoding the ADAMTS-N or ADAMTS-R1 
protein or allelic forms thereof. Such hybridisation probes are also 
useful for detecting transcripts of the genes which encode the 

20 ADAKTS-N family proteins or for mapping of the genes which encode the 
ADAXTS-h' proteins Preferably, such oligonucleotides comprise at .east 
"1? n.:r leo: i des . ""ore r. refer ah 1 y at least 2 3 C , most prefer di. ly from 
about 21C to 2 8 C nuciect ides . Such hyor : di zaticr. pieces have a 
sequence which is at least 30* complementary with a sequence 

2 5 contained within the sense strand of a DI, 7 A molecule which encodes an 



\\ O ill 1 HP4 



PCT I SOO-2 1 223 



- 19 - 

Tm of che probe* :o about 20 3 C to 25 °c below Tm . The probes are used 
in Northern assays tc detect transcripts of ADAMTS-N homologous ger.es 
and in Southern assays to detect ADAKTS-N homologous genes. The 
identity of probes which are 2 00 nucleotides 5 in length and have 
5 full complementarity with a portion of the antisense strand of a 
double-stranded DNA molecule which encodes the ADAMTS-N protein is 
determined using the nucleotide sequences sho.s'n in FIG 1 - 1C and 
described by the general formula a-b; where a is any integer between 
I and the oosition number of the nucleotide vnich is located 200 

10 residues upstream cf the 3' end of the sense r>r antisense strand of 
the cDNA sequences shown in FIG 1 -10; b is equal to a +2 00; and 
where both a and b correspond to the positions of nucleotide residues 
of the cDNA sequences shown in FIG 1-10. 

Such probes or primers are also useful for identifying tissues 

15 ar cells in which the corresponding ADAMTS-N or ADAKTS-P1 gene is 

preferentially expressed either constitutive!"/ or at particular state 
of tissue differentiation or development or in disease states. 
Expression of the ADAMTS-N or ADAKTS-R1 gene in a particular tissue 
or group of cells is determined using conventional procedures 

20 including, but not limited to, Northern analysis, in situ 
hybridization tc F.N A or RT-FCR amplification. Isolated 
pclynuc:-ttides encoding an ATAMTS-N or ADAMTS-?.: protein are also 
useful as chrcmcstm.e rrar"-:er? to mar linked gene per: -lens to 
identify chromosomal aberrations such as translocations, inversions 

2 5 and trisomies, tc compare with endogenous DNA sequences in patients 
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and chroTiosorr.es, PCR, and allele specific hybr idi :a: ion . 
Ant ibodi es 

In another aspect, the present invention relates to antibodies 
which are specific for and bind tc the AD AX 7 S - 5 protein, the ADAMTS - 6 
5 protein, the ADAKTS-7 protein, the ADAMTS-8 protein, the ADAMTS-9 
protein, the ABAXTS-10 protein, or the ADAMTS -Rl protein. Such 
antibodies are useful research tools for identifying *tissues that 
contain elevated levels of the respective protein and for purifying 
the respective protein frorr. cell or tissue extracts, medium of 

1C cultured cells, or partially purified preparations of intracellular 
and extracellular proteins by affinity chromatography. Such 
antibodies are also useful for identifying and diagnosing diseases 
associated with elevated or reduced levels of an ADAMTS -N protein or 
ATjAMTS-RI protein. Such antibodies are also useful for monitoring 

15 the effect of therapeutic agents on the synthesis and secretion of 
ADAKTS-N proteins by cells in vitro and in vivo. Such antibodies xay 
also be employed in procedures, such as co-immuncprecipitation and 
co-affinity chromatography, for identifying other proteins, 
activators and inhibitors which bind to an ADAMTS -N or ADAMTS -Rl 

2C protein. 

7he present invention also provides a method for detecting an 
ADAMTS-N cr ADAMTS - ? l protein, in a bodily sample :rcr. a patient 
using antibodies irr.ur.ocpec i f : c for sn ADAMTS -K cr ATAXTS -?1 protein . 
The method comprises contacting the antibody with a sarrple taken from 
2 5 the patient; and assaying for the formation of a complex between the 
antibody and the corresponding ADAMTS or ADAMTS -Rl protein present 
in the sarple . The sample "".ay he a tissue or a biclcgical fluid, 
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tissue, cells obtained from swabs and s-.ears. To monitor changes in 
expression of the ADAMTS-K protein during fetal development and 
pregnancy, it is preferred that the sample be amniotic fluid. To 
monitor changes in expression of the ADAMT5 -N protein during joint 
5 disorders, the preferred sample is synovial fluid. To monitor 
changes in expression cf ADAMTS-N proteins during cancer, the 
preferred samples include, but are not United to, serum, body 
fluids, or biopsy tissue. To monitor changes in expression cf 
ADAPTS -N proteins durinq inflammation the creferred samples include, 

10 but are not limited to, serum, body fluids, or biopsy tissue. 

The sample may be untreated, or sub3ected to precipitation; 
fractionation, separation, or purification before combining with the 
anti -ADAMT3-K protein antibody. For ease of detection, it is 

preferred that isolated proteins from the sample be attached to 

15 a substrate such as. a column, plastic dish, matrix, or membrane, 
preferably nitrocellulose. Preferably, the detection method employs 
an enzyme- linked immunosorbent assay ( EL ISA) or a Western im.T.unoblot 
procedure . 

Interactions between an ADAMTS - N protein in the sample and the 
20 corresponding anti ADAMTS-N antibody are detected by radiometric, 
cclonmetric cr f luorcmet r i~ means, 5i:e separation, cr 
rrecipitat :cn. Preferably, detection of the ant i body - AT AXTS - 
:• r o rem crmrlex : s bv ado i t ion cf a secondary antibody that i s 
coupled to a detectable tag, such as for example, an enzyme, 
25 i iuoronhore , or chrcmophore . Formation cf the complex is indicative 
cf the presence of the ATAMTS protein i n the test sample . Thus, 
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Cev:a:icr. between ccr.:roi and test values establishes the parameters 
for diagnosing the disease. 

Freparing the ADAMTS-K proteins and the ADAMTS-P 1 prote in 

The ADANTS-N proteins and the ADAMT-SR1 protein rr.ay be produced 
5 by conventional peptide synthesizers,. The ADAMTS-N proteins and the 
ADAXTS-R1 protein nay also be produced using cell- free 
translationsystems and FJJA molecules derived from DMA constructs that 
encode an ADAKTS-N protein or an ADAXTS-RI protein. Alternatively, 
A1\AYT?-N nrnrpir<; srp tisHp by r.ransf ecting host cells with expression 

10 vectors that comprise a DKA sequence that encodes the respective 
ADAKTS-N protein and then inducing expression cf the protein in the 
host. cells. For recombinant production, recombinant constructs 
comprising one or more cf the sequences which encode the ADAMT3-N 
protein or a variant thereof are introduced intc host cells by 

15 conventional methods such as calcium phosphate t rans feet ion , 'ZEAE - 
dextran mediated trans fee tion , transvect ion , microinjection, cationic 
lipid-mediated transf ect ion , electroporation, transduction, scrape 
lading, ballistic introduction or mferticn. 

The ADAMTS-N protein and the ADAMTS-Rl protein may be expressed 

20 in suitable host cells, such as for example, mammalian cells, yeast, 
bacteria, insect cells or ether cells jr.der the control, cf 
ace rem ate cremate re us i ng convent ion =t I techniques. Suitable bests 
include, but are not limited to. 2. coll, ?. pas tens, Tec cells and 
233. HEK cells. Following transformation cf the suitable host strain 

25 and growth of tne host strain to an appropriate cell density, the 
cells are harvested by centri fugat icn , disrupted by physical cr 
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cell pellets or from, cell culture medium, followed by salting-cut, 
and one cr more chromatography steps, including aqueous ion exchange 
chromatography, size exclusion chromatography steps, and high 
performance liquid chromatography (KPLC) , and affinity chromatography 
5 may be used to isolate the recombinant ADAMTS -N protein or ADAMTS Rl 
protein 

Preparation of Antibodies 

The ADAMTS proteins, and variants thereof are used as 

immunogens to produce antibodies inmunospeci f ic for one or more 

10 ADAMTS -N protein. The term n immunospe cif ic" means the antibodies 

have substantially greater affinity for one or more ADAMTS -N protein 

than for other proteins. Such antibodies may include, but are not 

limited to, polyclonal, monoclonal, chimeric, single chain, and Fab 

f ragmen 1 3 . 

15 Antibodies are also prepared using an oligopeptide having a 

sequence which is identical to a portion of the amino acid sequence 
of an ADAMTS -N protein. Preferably the oligopeptide has an amino 
acid sequence of at least five amino acids, and more preferably, at 
least 10 amino acids that are identical to a portion of the amino 

20 acid sequence of an ADAMTS -N protein. Such peptides are 

conventionally fused with those of another protein such as keyhole 
limpet nemccyani r. and antibody produced against tne cnimeric 
"::ec.;le . One p refer: ed oligopeptide f or preparing an antic r. dy 1 1: 
mouse ADAMTS - 5 lias tne sequence , C. HI K VRU F KA C TR F , SED. 12 I.'D : i C . 

25 Another preferred oligopeptide for preparing an antibody to ADAMTS - 5 
is CEAKL'GYQSDAXGYKT? VE>~/PXYAG , SEQ ID i:C : 3 1. One preferred 
cl i gcpept ide for preparing an antibody to ADAMTS - 6 has the sequence 
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preparing an antibody :c ADA.MTS-S has the sequence 

CVK£D7EN?KAYY0GDV,'GF, SEQ ID NO : 2 5 . One preferred oligopeptide for 
preparing an antibody tc ADAMTS-9 has the sequence 

QHPFQNEDYRPRSASPSRTH, SEQ ID NO: 26. Another preferred oligopeptide 
5 for preparing an antibody to ADAMTS - 9 has the sequence 

PQKCKEVKRLKGASEDGEYF, SEQ ID NO: 27. One preferred oligopeptide for 
preparing an antibody for ADAMTS - RL. has the sequence QELEEGAAVSEEPS , 
SEQ ID NO: 28. Another preferred oligopeptide for preparing an 
q^-hnriy for AD A**1" r \" - R 1 has the sequence Y YPENI KPKPKLOS ; SEO ID NO : 2 9 . 

10 Polyclonal antibodies are generated using conventional 

techniques by administering the ADAMTS-N protein or achimeric 
molecule to a host animal. Depending on the host species, various 
adjuvants nay be used to increase immunological response, Among 
adjuvants used in humans, Bacilli Calmette -Guerin (BCG) , and 

15 Corynebacterium parvum. are especially preferable. Conventional 
protocols are also used tc collect blood from the immunized animals 
and to isolate the serum and or the IgG fraction from the blood. 

For preparation cf monoclonal antibodies, conventional 
hybridoma techniques are used. Such antibodies are produced by 

2C continuous ceil lines in culture. Suitable techniques for preparing 
monoclonal antibodies include , but are not limited tc, the hybridoma 
techni cue , the hum.an B-ceil hycri ctma technique, and the E5V 
r.vbridcma technique. 

Various immunoassays may be used for screening to identify 

25 antibodies having the desired specificity. These include protocols 
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prcteiri or an A2A>:ts-r: prctein may be synthesized ir. whc^e or 

in part: us.ng chemical methods. Polynucleotides which encode an 

ADAPTS -N protein, particularly alleles cf the genes which encode the 

ADAXTS-N protein, may be obtained by screening a genomic library or 

5 cDNA library with a probe comprising sequences identical or 

complementary to the sequences shown in Figures 1 - 10 or with 

antibodies immunospecif ic for a AEAMTS-N protein to identify clones 

containing such polynucleotide. 

Example 1 ADAMTS-512 protein 
10 A cDMA encoding mouse ADAX7S-5 prctein was obtained using IMAGE 

Clone 563515, purchased from Research Genetics, Huntsville, Alabama 

and 7 day old mouse embryo cDNA library from Clontech, Palo Alto, 

CA. A cCNA encoding human ADAMTS-S protein was obtained using IFLAGE 

Clone 345484 purchased from Research Genetics, Huntsville, Alabama 

15 and a human fetal brain cDNA from Clontech. The clone inserts were 
sequenced in their entirety. Using oligonucleotide primers based on 
the sequences at the ends of the. clone inserts as template, 
successive rounds of RACE (Rapid Amplification of cDNA Ends) by FCR 
was performed at 5' and 3 ends. RACE primers were generated 50-2CC 

2 0 bp from the ends cf the sequences so that the contiguity of RACE 
clones with the I . K . A. . G . E . clone could be clearly established. A 
single round cf 5' and 3' 20 RAC- sufficed for cloning cf the entire 
ceding sequence cf the reuse ArAttTi-S prctein and part of the 
catalytic zinc binding site through to the step cod on of the human 

2 5 ADAPTS- 5 prctein. Prim.ers were designed with calculated Tm>72 ; C and 
RACE was performed with nested pn~ers for each amplification. FCR 
used the Advantage PCR reagents Clo-ntech, Palo Alto, CA) ; the 
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renditions; 95 "C for I minute followed by 5 cycles of 95 C C for 0.5 
minutes, 72 =C for 5 minutes, then 5 cycles of S5 C C fcr C.5 minutes, 
7Q C C fcr 5 minutes and 20 cycles cf 95'C for 0.5 minutes, 6c -C fcr 5 
minutes. The PCR products were analyzed by Southern blotting, 
5 initially using [a' 2 ?j -dCTP labeled. 

Hybridizing bands were ligated into pGSM-T Easy ;Fromega, 
Madison, V.'I) and individual clones were selected by another round of 
Southern analysis. Automated nucleotide sequencing of both strands 
or cadi clone, were done at the r!c»cCu*di 3iu"_c , wlii.ul^yj'' Cui c yf Lhc 

10 Lerner Research Institute, Cleveland Clinic Foundation and nucleotide 
sequence data were analyzed using the DNAStar software. By- 
integration of the overlapping sequences thus obtained, a contiguous 
nucleotide sequence was determined. The nucleotide sequence of the 
mouse ADAMTS-5 cDNA and the predicted amino acid sequence cf the 

15 protein encoded by this cDNA are shown in Fig. 1. The nucleotide 
sequence cf the human ADAMTS-5 cDNA and the predicted partial amino 
acid sequence of the protein encoded by this cDNA are shown, in Fig, 
2 . 

The predicted m.ciecular mass (Mr) of the mature ADAMTS-5 
20 protein is 73~17.50 daltons. It is expected that the actual Mr of 

the active ATAMTS-5 prot ei n is different due to post - L r a n s 1 a L i on a 1 

domd in organ i zaticr. cf ADAVT.3 5 prote i n rel at i ve t r. the c 1 c n e d c7VA 
is shewn in Figure 12. The pro-domain of the full-length mouse 
25 ADAMTS-5 protein has 3 consensus cleavage signals fcr furm. The 
most carbcxyl - termma 1 furm cleavage site m ATAVTS - 5 predicts the 



WO ill 1 in~4 



PCT L Suit. 21223 



while three residues are downstream, ar. arrangement that is shared 
with other ADAXT3 members. The zinc binding signature is followed by 
a "Met - turn" . The catalytic detain is followed by a domain with 35* 
similarity to snake venom dis mtegrins . The dismtegrm detain 
5 contains eight cysteine residues. The first TS repeat contains 52 
residues and is followed by a conserved cys teine - rich sequence termed 
the cysteine-rich domain, designated "CRD", to distinguish it from 
the cysteine-f ree spacer domain. The CRE contains ten conserved 
c ,; 5t?ines ?_r_a demonstrates hi^h secruence h_o fT% olo r,v wi ri the c* 1 " 5 

1C other AEAMTS-N proteins. The spacer dona in of mouse ADAMTS-5 is 15 3 
amine acids in length and is followed by a second TS module. ADAMTS- 
5 contains three potential glycosy 1 at ion sites in the mature protease 
one of which is just upstream of the start of the spacer domain and 
the second lies within the spacer domain and the third is near the 

15 start of the dismtegrm domain. The human ADAMTS-5 protein and the 
mouse ADAMTS-5 protein have 96% sequence identity. ADA.YT3-5 bears 
46% sequence identity tc ADAMTS-4 1KIAAC586! , which is characterized 
as being involved in catabolism of aggrecan core protein in arthritis 
and 60% identity to ADAKTS-1 which is involved in inflammation. 

2 3 Example 2 ADAMTS - 6 

The nucleotide sequence of a human cDNA encoding the full- 
length AEAMTS-6 protein was obtained using IMAGE clone 74 2 6 3.0, which 
encodes EST AA4 003 92 , and a human fetal cram cDNA from Clont ten. 
RACE was performed as described above in Example 1. The l.M.A.G.E. 

25 clone 74253C contained an CR? fiankeu by consensus splice sequences, 
indicating the preser.ee of intrens. Two successive rounds of PACE at 
tne 5' end and a single round of RACE at tr.e 3' e:.: provided the 



\Y0 0Mlir4 



IK Y I SOO 21:23 



-23- 

The nucleotide sequence cf the ADAXTS - 6 DNA is shora in Fig. 3 
The predicted amino acid sequence,. EEQ ID NO ; 6 , of the ADAMTS-6 
protein is also shown in Fig. 3. The predicted Mr of the full- 
length, unprocessed ADAMTS-6 protein is 97,115 daltons. , and the 
5 predicted Mr of the mature ADAXTS-6 protein is 68412.10 daltons. The 
domain organization of the ADAMTS-6 protein is shewn in Fig. 12. The 
pro-domain of the full-length ADAMTS-6 protein has one consensus 
cleavage signal for furin. The catalytic domain cf the ADAMTS-S 
cent?. ins ?iy cy<?te i^e rp^^npR ^nd fhp rpprnlvnin -zinr b'lr.dina 

10 signature sequence, HE I VTtNFGXKHD , which is followed by a "Met-cum". 
The catalytic domain is followed by a detain with 35% similarity to 
snake verier: disintegrins . The disintegrin domain contains 

eight cysteine residues. The first TS repeat contains 52 residues 
and is followed by a conserve CRD sequence which contains ten 

15 conserved cysteines and demonstrates high sequence homology with the 
CRD of other ADAPTS proteins. The spacer domain of ADAXTS-6 is 127 
amino acids in length and is followed by a second TS module . ADAMTS - 
6 contains four potential glycosylat ion sites within the pyo- domain 
and two in the mature protease one of which is in the cysteine rich 

2D domain and the ether of which is in the spacer dom.ain. ADAMTS-6 

bears 4 6* sequence identity to AD AX T 3 - 1 , which is involved in 

ir.f 1 ammat 1 ::. . 

1X3FT I e ADAMTS - "> . 

The nucleotide sequence of a cDKA encoding an AD-AXTS - 7 protein 

25 was obtained using IMAGE clone 272CS9, which encodes EST t;4.S032, and 

a human fetal brain cDMA frcm Clcntech. RACE was performed as 

described above in Example I. The :.M.A.G.E. clone 2 7 2 0 9 8 encoded a 
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nethiomne codon lies within a satisfactory Kozak consensus for 
translation initiation. 

Tne nucleotide sequence cf the ADAMTS-7 cDN'A is shewn in Fig. 
4. The predicted amino acid sequence, SEQ ID NO: 8, of the ADAMTS-7 
5 protein is also shown in rig. 4. The predicted Mr of the hill- 
length, unprocessed ADAMTS-7 protein is 116,607 daltons, and the 
predicted of the T.ature ADAMTS-7 protein is 840C5 daltons. The 
domain organization of the ADAMTS-7 protein is shewn in Fig. 12. The 

10 cleavage signal fcr furin. The catalytic domain cf the ADAMTS-7 
protein contains eight cysteine residues and the reprolysin- zinc 
binding signature sequence, HELGHSFGIQHD, which is followed by a 
"Met-tum". The catalytic domain is followed by a domain with 3C% 
similarity to snake venom disintegrins The disintecnn domain 

15 contains eight cysteine residues. The first TS repeat contains 52 
residues and is followed by a conserved CRD sequence which contains 
ten conserved cysteines. The spacer domain of ADAMTS-7 is 221 amino 
acids in length and is followed by a second TS module and a short 
sequence containing two cysteine residues. ADAMTS-7 contains three 

2 0 potential glycosy lat ion sites within the mature protease; one of 
which is just upstream of tne spacer domain and one cf which is 
w i t h i n the spacer do m a i n . AD AMDS - ? bears 35 * sequence identity t o 
ADAMTS-1. which is characterized as being invclved in mf ia~mat icr. 
and 32% identity to ADAMTS - 2 which is a procollagen processing 

2 5 enzyme. 

F y ample 4: ADAMTS - 8 
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prctem was obtained using IMAGE clone 211SS38, which encodes EST 
A14CO905, and a human fetal bram cDKA library from Clcntech. RACE 
was performed, as described above in Sxar.pl e 1. The nucleotide 
sequence of the cDNA encoding the full-length ADAMT3-8 mouse protein 
5 and the amino acid sequence of such protein is shown in Fig. 5. The 
nucleotide sequence of the cDNA encoding the partial ADAMTS-3 human 
protein and the amino acid sequence of such protein is shown in Fig. 
6 . 

The nredicted Mr of the full-length, unprocessed APAMTS-? TlOu?*? 

10 protein is 12606S3 daltons, and the predicted Kr of the mature 

ADAXTS-8 protein is 63412. 1C daltons. The pro domain of the full- 
length ATjAMTS-3 protein has one consensus cleavage signal for fur in. 
The catalytic domain contains eight cysteine residues and the 
reprolysm-z inc binding signature sequence, HELGHVLSMPHD , which is 

15 followed by a "Met-turn". The catalytic domain is followed by a 
domain with 2C-30% similarity to snake venom dis integrins . The 
disintegr in- 1 ike domain contains eight cysteine residues. The first 
TS repeat is followed by a conserved CRD sequence which contains 10 
conserved cysteines. The spacer domain of ADAMTS-8 is 146 amino 

2C acids in length and is followed by a second TS nodule. The ADAXTS-8 
prctem contains 4 potential glyccsy 1 at icn sites within the mature 
protease : one is m the cysteine -rich cer air.; one is in the 
r n t t i 1 y t ; c :7.c->a i n ; and two are in '. ne d i s i n t e g r i r. - 1 : ke coram . 
ATAKTS-8 bears 46% sequence identity tc ADAPTS-! and 42% identity tc 

2 5 ATJAKTS-4 . 

Example 5: A2AJ-:TS-_9 

"The nucleotide sequence of a c~KA encoding a fuli-lengtn, nur.ar. 
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proteir. was obtained using IMAGE clone 535t63, which er.ccdes EST AAl 
062 15, and a mouse cDNA library obtained from Clonetech. RACE was 
performed as described above in Example 1. The nucleotide sequence 
of the cDNA encoding the full-length ADA.YTS-9 human protemand the 
5 ar.ino acid sequence of such protein is shown in Fig. 5. The 

nucleotide sequence of the cDKA encoding the partial ADAMTS - 9 mouse 
protein and the anino acid sequence of such protein is shown in Fig. 
7 . 

Tine predicted Mr of the mature human ADAMTS-S protein is 
10 1&S777.20 daltons. The prodomain of the predicted ATJAMTS - 9 protein 
has 3 consensus cleavage signal for furin. The catalytic domain of 
the ADAPTS -9 contains eight cysteine residues and the reprolysm - 
zir.: binding signature sequence, HELGHVFNMPHD, which is followed by a 
"Met-curn". The catalytic domain is followed by a domain with 25-30% 
15 similarity to snake venom disintegrins The disintegrin domain 

contains eight cysteine residues. The first TS repeat contains is 
fcl lowed by a conserved CRD sequence wnich. contains 10 conserved 
cysteines. The spacer domain cf ADAMT3 - 9 is 124 amino acids in 
length and is followed by 14 additional TS modules and a C- terminal 
20 dcmain. The AD AM T 3 - 9 protein contains 6 potential glyccsylat ion 
sites within the mature protease: one in the spacer domain, one m 
TSP 1 -7, one in TSPI-9, and 3 m the c- terminal domain. The ADAMTS- 
3 hears 4-1% sequence identity tc AUAMTG-4. 

2 3- The nucleotide sequence cf 3 cDN\ encoding a fail - length 

AIAXT3- 10 prctem was obtained using IMAGE c.cne 11C4.C3, wnicn 
encodes EST AA5834 34, and a human fetal brain cENA from Cionetech. 
The nucleotide sequence cf a cENA encoding a partial, mouse ATA^TS-10 
protein was obtained using IMAGE clone 10~7653. which encodes EST 
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performed as described above in Example 1. The nucleotide sequence 
of the human ADAMTS -10 cDNA and Che predicted amino acid sequence, 
SHQ :D 18, of the human ADAMTS- 10 protein encoded by such DMA is 
shown in Fig. 9. The nucleotide sequence of the cDNA encoding the 
5 partial mouse ADAMTS- 10 protein and the amino acid sequence of such 
protein is shown in Fig. 10. 

The predicted Mr of the mature ADAMTS -10 protein is 952 33 
daltons. The pro-domain of the full-length ADAMTS -10 protein has no 

-^-■•^■-'■-^ - * ^ " ' 3 w " - J" ~ - ------- j 

10 ADAMTS- 10 contains eight cysteine residues and the reprolys in - zinc 
binding signature sequence, HEIGHTFGMNHD, which is followed by a 
"Wet-turn". The catalytic domain is followed by a domain with 3C% 
similarity to snake venom dismtegnns . The dis integrin- like domain 
contains eight cysteine residues. The first TS repeat is followed by 

15 a conserved CRD sequence which contains 3 conserved cysteines. The 
spacer domain of ADAMTS- 10 is followed by 4 additional TS modules and 
a Kur.it z domain. The ADAMTS -10 protein contains 2 potential 
glycosylation sites within the mature protease: one in the catalytic 
domain, and one in the TS 1-3 domain . ADAMTS -10 bears approximately 

20 40% sequence identity to ADAM-TS1, which is characterized as being 
involved in inflammation. 

As shown m Figure 11, the ADAMTS - 5 . ADA.VTS-':, r.r.i ADAMTS- - 
proteins share a common domain organization. From amine to carbcxyl 
25 termini, they are as fellows: 

1. A pre-pro region. A typical signal sequence of variable length 
is followed by a putative pro-region of variable length but 
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context s:r.:lar to -he cysteine "switch" of the KKPs . All three 
r.cvel cDKAs predict consensus cleavage signals for furm, three in 
the case of ADAXTS-5, and one each in the case of ADAMTS - 6 and 
ADAMTS-7. The most carboxyl - terminal furir. cleavage site m ADAMTS -5 
5 predicts the processing site for generation of the mature protease. 
The amino terrr.inus of the mature proteins is predicted to start at 
the residue immediately following the cleavage sites. 

2. A catalytic domain. The catalytic domains are very similar to 
each other <=*nri r~ r ~ r * i r eight cysteine residues and a typical 

10 reprolysin- type zinc binding signature followed by a "Ket-turn" . 
Five cysteine residues are upstream of the zinc binding sequence, 
while three residues are downstream, an arrangement that is shared 
with other ADAMTS members. The methionine of the met -turn is not at 
a constant distance from the zinc-binding signature, but in all three 

15 novel proteases, a constant cysteine residue is present in that 
interval . 

3. A disintegrin-like domain. The catalytic domain is followed by 
a domain of 60-SO residues with 35 -4 5% similarity to sna^.e venom 
dismtegnns , but without the canonical cysteine arrangement seen in 

20 the latter. This disintegrin- like domain is of comparable length in 
ADAMTS -5 and ADAMTS-", it is considerably shorter : r. ADAMTS-6. 

4. A TS module. The f i rst TS repeat is very similar : r. all z'r.i ee 
novel pre teases and very similar to the first T3 repeat of other 
ADAMTS s. It contains the same number of residues (fifty- two' m a_l 

2 5 three novel proteins. 

5. The cys teine -rich domain. This T3 domain is followed by a 
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other dcrr.ai.ns . It shows the least homology of all the domains. 
7. A C- terminal TS module. The sequence cf the second TS m.odule 
is r.ore variant between the members cf the ADAMTS family than the 
first TS module, despite the conservation of the number and spacing 
5 of cysteine residues. 

Overall, the predicted mature forms of these proteases show 2 3- 
3C% similarity to each other and to ADAMTS 1 - 4 although this may be 
considerably higher or lower for individual domains as described 
shnvp . 

10 ADAM - TS 9 and ADAM-TS1C contain all the domains present in 

ADAMTS -5 through ADAMTS- 8. In addition, ADAMTS - 5 and ADAMTS-10 
contain the following domains: 

A. ADAMTS- 9: After the c- terminal TS1 domain which is 
present in ADAMTS 5 - 8 , ADAMTS- 9 contains 13 additional and homologous 

15 TS11 domains, thus, ADAMTS - 9 contains a total of 15 TS1 domains, of 
which 14 are adjacent to each other in the c-terminal half of the 
molecule. The 15th TS1 domain from the N-terninus is followed by a 
unique c-terminal domain which dees not possess recognizable domain 
structure and contains 196 residues including 9 cysteine residues. 

20 B. ADAMTS-10: After the c-terminal TS1 domain which is 

present in ADAMTS 8 , ADAMTS-10 contains 3 additional and homologous 

of wh:ch 4 are adjacent to each other m the c- termi nal half cf the 

molecule. The 5th TS 1 domain from the ::- terminus is followed by an 

~ ?, additional 47 amino acid residues including six ; 6: cysteine 

residues . These 4 '• res ; iues have sequence similarity of 2 C * - 4 0 *• : o 
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from human and ^ouse tissues (Clontech, Palo Alto, CA) were 
hybridised to the [cr : P] -dCT? labeled inserts of I . M A.G.E. clcnes as 
per the manufacturer's recommendations followed by autoradiographic 
exposure for 3-7 days. 
5 In situ hybridization used cryosections of mouse embryos of 

gestational age 8.5 days and 10.5 days. Embryos were collected with 
the inclusion of the surrounding uterus and fixed overnight in 4% 
paraformaldehyde. Sense and anti-sense probes continuously labeled 
with digoxig en in - UTF [ EoeJii. liiyei -Mannheim , Indianapolis, liw were 

10 transcribed with T7 and T3 RNA polymerases, respectively, using as 
template a 63 0 bp EccRI-Sacl fragment from, the Adair,ts-5 clone 5695 15 
(Fig. 14) cloned into pBluescript SK+ (Stratagene, La Jolla, CA) . In 
situ hybridization was done essentially as previousl/ described in 
Apte, et al. (1997) J. Biol. Chem. 272:2551-25517, which is 

15 specifically incorporated herein by reference, except that sections 
were predigested with proteinase K (Ecehringer-Mannheim, 
Indianapolis, IN) at a lower, concentration (1 -5 ^g/ml) than 
reported in Apte, et al . . Bound, digcxigenin- labeled probe was 
detected using an alkaline phosphatase tagged ant; -digoxigenm 

2C antibody (Boehringer-Mannheim, Indianapolis, IN"} and nuclei were 
counters tamed with methyl green. 

Specific hybriuizacier. cf the ant isense Adamrs-5 piece to 
tectior.o cf 3 . 5 day -eld reuse embryos was obtained, whereas only low 
background staining was noted with the control sense probe. Staining 

2 5 was uniform throughout the 8.5 day old embryos. In add! tier., there 
was labeling cf mRNA in t rcphot 1 a s 1 1 c cells lining the uterine cavity 
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day-old eriryc. Labeled cells were seer, in mesenchyme and sor..:es as 
well as in the neural tube ar.d developing hindgut Northern analysis 
also indicated that rnRNA encoding ADAMTS-5 was present in human 
placenta but was barely detectable in adult lung, heart, brain, 
5 liver, skeletal muscle, kidney and pancreas. 

Northern analysis shewed undetectable expression cf Adapts- 6 
during mouse embryo development. Northern analysis indicated that 
mRNA encoding ADAMTS-6 was present in hux.an placenta but was 

b.^rp 1 v dpt-prrsblp i r, adult lurg, he^rt , brain, -■>'<=> >_ -kplptsl 

10 muscle, kidney and pancreas. Ada.-rtzs-l was expressed at low levels 
throughout mouse development. In adult human tissues examined with 
human eDITA probes, ADAMTS-7 mRNA was found in all tissues examined, 
i.e. in lung, heart, brain, liver, skeletal muscle, kidney, pancreas 
and placenta. The sizes of the mHXA species recognized by the probes 

15 varied. ADAMTS-5 mKN'A was approximately 1C kbp in size in human 

tissue. The most prominent Adamts-5 species was estimated at 7.5 kbp 
together with additional bands at 1C kbp and -i 5 kbp. The lone mRNA 
species detected by ADAKTS-6 probe was approximately 3.5 kbp, whereas 
the most common m.Ri:A species detected by ADAMT3-7 probe 5 was 5 kbp 

20 in size with an additional species seen at 7 kbp m skeletal muscle. 

In mouse , AT AM T 5 - 6 is expressed during fetal deve 1 cement 'days 
~, ::, ;5 l? i and in adult mouse lung and heart with an mP.MA size of 
arc roximat elv "5 . 5 k'cp . Tr. adult human t issue , ATJAXTS - S : s expressed 
in lung and brain cut net in heart, muscle, kidney, colon or thymus. 

2 5 The mRNA size is 2.S kbp. 

ADAPTS-? is expressed m iur.s, ovarv placenta, heart, bra:r. . 
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ADAXTS - 1 j is expressed in thymus, prostate, testis, ovary, 

small intestine, colon, peripheral bleed leukocytes, heart, brain, 

placenta, lung, liver, muscle, kidney and pancreas, as well as m 

many cell lines such as A54 9, HeLa and K562. There are two 

5 transcripts of 5 kb and 8kb present in all tissues. 

Example 7: ADAMTS-R1 

The nucleotide sequence of a cDNA encoding a full-length 

ADAMTS-R1 protein was obtained using IMAGE clone 752797 which encodes 

EST AA, and a hurr.an fetal brain cDKA from. Clontech. RACE 'was 

13 performed as described above in Example 1. The nucleotide sequence, 
SEQ ID M0:21, of the ADAMTS-R1 cDNA and the predicted amino acid 
sequence, SEQ ID NO: 22, of the ADAXTS-Rl protein encoded by such DNA 
is shown in Fig. 11. 

The predicted Mr of the full -length, unprocessed ADAKTS-R1 

15 protein is 58358. 2C daltcns. The domain organization cf the ADAMT5- 
10 protein is shown in Fig. 15. In contrast to the ADAMTS-N proteins 
of examples 1-6, ATJAMT5-R1 protein does not have a pro- 
metalloprotease cr disintegrin-like domain or a consensus cleavage 
signal for furin. ADAKTS-R1 has a signal ;pre) peptide which is 

20 followed by a first TS module and a conserved CRD sequence which 

contains 10 conserved cysteines. The spacer domain cf ADAttTS-Rl is 
llr ammo acids in length and is followed by 5 additional TS modules 
and a short sequence of 33 ammo acids . The ATAXTS-Rl prcteir. 
contains one potential clycosylat ion sites which is in the spacer 

25 domain. ADAXTS - Rl bears 3C -40% sequence identity to ADAXTS 1 and 

ADAMT54 m the related domains. ADAXTS -Rl mRIJA is present m human 
heart, bra in , kidney , r.uscle, lung, placenta, testis, ovary, cc.on , 
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Althcuch certain enricai :ner.t s c£ this invent 1 on have been shewn 
and described, various adaptations ana rrodi f i caz i ons can be made 
without departing from the scope cf the invention as defined in the 
appended claims. 
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An isolated mammalian protein selected frorr. the group 
cons-Sting of an ADAMTS -5 protein an ADAMTS -6 protein, an 
ADAMTS -7 protein, an ADAMTS -8 protein, an ADAMTS - 9 protein, an 
ADAMTS-IO protein, and an ADAMTS -Rl protein. 

The isolated marr.-nalian protein of claim 1 wherein said protein 
comprises an amino acid sequence which is at least 95% 
identical to a sequence selected from the group consisting of: 
amino acid 262 through amino acid 930 of SEQ ID NO- 7 ; ^mino 
acid 1 through amino acid 515 of SEQ ID NC:4; amino acid 245 
through amino acid 860 of SEQ ID KC : 6 ; amino acid 23 3 through 
amino acid 997 of SEQ ID NO : 8 ; ammo acid 229 through ammo 
acid 905 of SEQ ID N0:1C; amino acid 1 through amino acid 245 
of SEQ ID NO: 12; amino acid 235 through amino acid 1682 of SEQ 
ID NO: 14; ammo acid 1 through amino acid 674 of SEQ ID NO: IS; 
amino acid 212 through amine acid 1C61 of SEQ ID NO; IB; amino 
acid 1 through amino acid 45C of SEQ ID NO: 20; and amino acid 1 
through amino acid 54 7 of SEQ ID NO: 22. 

The isolated protein of claim 2 wherein said amine acid 
sequence further comprises a prepropept ide sequence at the 

a™:~o terminus the reef. 

Tr.e isolated r rote in of claim 1 wherein said protein is a human. 
ADAMTS— prcte : n cr :i mouse ADAMTS - 5 protein. 

The isolated protein of claim 1 wherein said protein is a human 
ADAMTS -6 protein. 
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ADAM T 3 - 3 or a r.ouse ADAMTS - 9 protein. 
5. The isolated protein of claim : wherein said protein is a human 

ADAMTS -10 or a mouse ADAMTS- 10 protein. 
1C. The isolated protein of claim 1 wherein said protein is a human 
5 ADAMTS -Rl protein. 

11. An isolated polynucleotide comprising a sequence which encodes 
a mammalian protein selected from the group consisting of an 
ADAMTS - 5 protein, an ADAMTS - 6 protein, an ADAMTS - 7 protein, an 
ADAMTS - S piulcui, a 11 ADAMTS - 3 protein, an j-lDAMT^> 10 protein, 

10 and an ADAMTS -Rl protein. 

12. The isolated polynucleotide of claim 11 wherein said protein 
comprises an ammo acid sequence which is at least 55% 
identical to a sequence selected from the group consisting of: 
amino acid 262 through amino acid 930 of SEQ ID NO : 2 ; amine 

15 acid 1 through amino acid 518 of SEQ ID NO •. 4 ; amine acid 245 

through amino acid 86 0 of SEQ ID NO : 6 ; amine acid 23 3 through 
amino acid 997 of SEQ ID NO : B ; ammo acid 22 9 through amino 
acid 905 of SEQ ID NO: 10; amine acid 1 through amino acid 245 
cf SEQ ID KO:12; amino acid 236 through amino acid 1382 of SEQ 

20 ID NO: 14; amine acid 1 through amino acid 874 of SEQ ID NO: 15; 

ammo acid 2 12 through amino acid 1C9 1 cf SEQ ID NO: 15: amino 
acic, 1 through amine acid 4 EC cf SEQ IT h" 2 : 2 :• , 3 r. d a~mc acid 1 

13. The isolated polynucleotide of claim 11 wherein said nucleotide 
25 sequence encodes a prctem having a signal sequence at the 

3T.m:> termmus therecf . 
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nucleotide 1519 of SEC ID NO : 3 or an allelic variant: thereof; 
nucleotide 7=4 through nucleotide 2 602 of SEQ ID NO : 5 cr an 
allelic variant thereof; nucleotide 708 through nucleotide 3033 
cf SEQ ID NO: 7 cr an allelic variant thereof; nucleotide 962 
5 through nucleotide 2992 of SEQ ID NO : 9 or an allelic variant 

thereof; nucleotide 1 through nucleotide 73 9 of SEQ. ID NO: 11 cr 
an allelic variant thereof; nucleotide 70B through nucleotide 
5648 of SEQ ID NO : 1 3 cr an allelic variant thereof; nucleotide 
1 ^>^v-/-,mqv- nucleotide 2625 cf SE n ID NO: 15 or an allelic va.ri'?. r. t 

10 thereof; nucleotide 634 through nucleotide 3243 of SEQ ID NO: 17 

or an allelic variant thereof; nucleotide 1 through nucleotide 
1642 of SEQ ID NO: 19 cr an allelic variant thereof; and 
nucleotide 51 through nucleotide 162 5 of SEQ ID NO : 2 1 cr an 
allelic variant thereof. 

15 15. The isolated polynucleotide of claim 11 wherein said 

polynucleotide hybridizes under stringent conditions to a 
nucleic acid molecule comprising a sequence complementary to 
the protein encoding sequence of SEQ ID NO : 1 ; SEC ID NO : 3 ; SEQ 
ID N0:S; SEQ ID KO : 7 ; SEQ ID NO : 9 ; SEQ ID NO: 11; SEQ ID NO : 1 3 ; 

2C SEQ ID NC:15; SEQ ID NO: 17; SEQ ID NO : 1 9 ; or SEQ ID NO : 2 1 . 

15 . An isolated polynucleotide having a sequence which is 

17. At. expression vector comprising a polynucleotide of claim 11. 
25 15. A host cell transformed cr transfected with an expression 
vector c f ;la.T; I " . 
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prc:e:n; and 

(b' recovering said ADAXTS-N pro rem cr said ADAMTS 
prcte.n from the host cell culture. 

20. An antibody that binds to a protein selected from the group 
consisting of an ADAMTS-5 protein, an ADAMT3-6 protein, an 
ADAMTS-7 protein, an ABAMTS-8 protein, an ADAMTS - 9 protein, an 
ADAMTS -10 protein and an ADAMTS -Rl protein. 

21. An oligopeptide for producing an antibody that binds to an 



10 oligopeptide has a sequence selected from the group consisting 

cf : 

a) SVSIERFVSTLWADK, SEQ ID NO; 23; 

b) EVAEAANFLALRSEDPDKY, SEQ ID NO: 24; 

c) V KED YENP KAVYDGD WG P , SEQ ID NO: 2 5; 
15 d) QHFFQNEDYRPF SAS PSRTK , SEQ ID NO : 2 6 ; 

e) FQNCKEVKRLKGASEDGEYF, SEQ ID NO : 2 7 ; 

f) QELEEGAAVSEEPS, SEQ ID NO : 2 6 ; 

g) YYFENIKFKPKLQE; SEQ ID NO : 2 9 ; 

h) HIKVRQr KAKDQTRF; and 

2C l) CEAKNGYQSDAKGYKTFVEWYPKYAG, SEQ ID NO: 33. 
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Fig. 1 



FEATURES Locaticn/V-jali f iers 

sc-ice 1..30D2 

/ or gani st.= " Kns rrj.s tu lus " 

,'db_xref=" taxon: 1009G* 

/chro:nc5cme= "Mouse 16* 

/rap="58 cM (consensus position) ' 
gene 1..3002 

/r.oces'a dis integrin-liXe arid ir.etalloprstease do:?.air. with 

ihrombospondin type I repeats) " 

/ 3 ene=" Adapts S' 
CT-5 13.. 2810 

/gene= " Adapts 5 " 

'note=' putative secreted netalloprotease' 
1 codon_s tart*l 

'product^ "ADAM-TS5 (a dismtegrin-like and Tr.etallcprotease 
domain with thrsrnbospondin type I repeats) " 

/ translation * MRLEWA5T T i] T'.T T.T SASCLolAAESPAAAPAQDKTRQPCAAAA 
AAEP DQ PQGESTFSFGHLQPIAGQRR SGGLVHN I SQLYSGOGSVGVLVYAQGR^LLD 
U^DITTVGAAGS TVTAGGGL S AS SGHF£KCFYRG7\ T DG S PRSLAVFDLCGGLDG FF AV 
YH&PVTT.VPT.-.Pr. SWAEYERIYGDG SSRI LHVYNREG7SFEAL? PRASCETPAS ? SG? 

^ E 5P SVKSRSKRSSAIAPCLLLHS AF S PSGTiAG pgTWWRRRRRS I SPARQVELLLVAD 

A?Tm,TCIC;i?QHOHNQ* J G*:CKESHVnAAI LFTREDLC GHHSC SfTLGMAIVGT ICSPER 
SCAVI^KI^AAF^AKEIGKL^^^ 

PWSK^TSAT ITEFLDDGHGKCLIjDL PRKQ ILGPEELPGQTITATQC'CIJLTFGFE' f SVC 
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3KSFRr3CC5A:c;cr: wSoa:- .3\r-rv-r.-,Y?KYAGYL PATYCHL7C. . _ . - 

>7,TI^TZC?.PYSr. , S'v'rvT..G?.C".'?>TGCDG I IGrr^YDXCGVCGGI^SSCTK:: IGTFNX 

: c n;G-r.7 2nT sgwsj-irzdft-h gmgysatxe i l iv; : latdpt kalgyr vs ffv fkktt 

QKWSV 1 SK 3 SNJVo PM5 TQL QWVTG PWLAC 5 P.ZCZK&.rTTRTVQZQECjS&XlJj:: G"L 
ISnRPSAITOCLLKrC" 
BASE GG177T 725 a 73& c £45 g £45 t 

crigi:; 

1 ccggcgggca gcgcactatg cggctcgagt gggcgrccct, gt.gcrgcta ccgrtgctgc 
£1 tgagcgcgtc ctgcccct.cc ctggccgctg acagccccgc cgcggcaccc gcc=agga::a 
121 aaaccaggca gcctcaggcc gcagcagcgg ccgccgagcc ggaccagccg cagggggagg 
181 aaacacggga gccaggccat ttacaaccct tggccgggca gcgcaggagc ggcgggcigg 
241 tccatzaatat agaccaactc tartctggcg gtggcaaagt gggctacctt gtctacgcgg 
331 gcggccggag qttcctgctg gacctggaga gagatgacac agtgggtgct gctggtagca 
351 tcgttactgc aggaggaggg ctgagcgcac cctccggcca ccggggtcac tgtttctaca 
421 gaggcaccgt ggacggcagc cctcgacccz tagccgtctt tgacctctgc gggggizctcg 
491 atggcttcr- cgcagtcaag catgcgcgct aca~tctiaa gccactcctg cgtgggtcct 
541 gggcacagca tgaacgaat:*: tatggggacg gatcttcccg catcctgcat gtcLacaacz 
6C1 gcgagggctt tagcttcgag gcrctgccgc casgcgccag ttgcgagact cczgcatccc 
£61 catcrgggcc ccaagagagc ccctcggtgc acagtagatc taggagacgc tcagcgctgg 
721 ccccgcagct gctggaccac tcagrt-tct cgccatctgg gaacgcggga cctcagactc 
761 yyL'jjgagg-g taggcgccgt :c:acc;:ca gggcccgcca ggcggagctc ctcttggtgg 
841 ctgactcgtc catggccagg acgtatgggc ggggcctgca gca^tacctg ctgacrcrgg 
9C1 cccccatcgc caacaggccg tacagtcatg caagcattga gaaccacacc cgcctggcgg 
961 -ggtgaaggt ggtggtgccg acggacaagg acacgagtct ggaggcgag= aagaacgrgg 
1021 ccacgaccct caagaactct tg:aaatggc agcaccaaca taaccagcta ggggatgatc 
1C31 acgaagagca ctacgatgca gccatcccgt ccacccgaga ggatttatgc gggcatcatL 
1141 catgtgacac cctgggaatg gcagacgttg ggaccatatg ttctccggag cgcagrtgtg 
12C1 cagtgattga agatgatggr ct:catgcag ccttcactgt ggctcatgaa attgggcatc 
1261 tacttggccc ttctcatga:: gattrcaaat tctgtgaaga gaacttcggt actacagaag 
1j21 acaac:gtr.t aatgccttca atrcttazca gcatcgatgc atccaagccc tggtcraaa; 
1381 gcacctcagc caccatcaca gaattcctig g atgatggtca tggtaattgc ttgciagacc 
1441 tacca-ggaa gcagattttg ggtcrcgagg aactcccagg acagacctac gatgcra=c= 
1501 agcagtgnaa cttgazattt gggcctgagt: acc.=ggtgtg ccctggcatg gatgt:tgtg 
1561 cgcggrtgtg gtgtgitgtg gtjcgccaag gccaaacggt gtgtctgacc aagaagctgc 
1021 cggctgtgga gggca=t==c tgrgggaagg gaagagtctg ccttcaaggc aaatgtgtgg 
1*E1 acaaaactaa gaaaaaatat ta:tcgacat caagccatgg aaattggggg ticcgggccr 
l"4i cctgggg^ca gtgtt-t:gc tcatgcgggg caggagtgca gtttg~ctac cgccattgta 
1E-C1 ataacictgc acctcgaiac agrggccgct actgcacagg gaagagggcc atataccgtt 
1£61 cctgcagtgt tacacrctgc ccacccaatg gtaaatcntt tcgccatgag cagtg-:gaag 
li-21 ccaaaaatgg ctatcagtc- gatg?aaaag gagtcaaaac atttgcagaa t rggCzrcca 
ir*31 aatatgcagg tgtcctgrcg gcragatgtg*: gcaagct^ac ccgcagagct aagggracag 
2 C 41 gctaciatgr cgt-cttrct ccaaaggtta cggatgggac tgaatgcagg crgtaragca 
2101 actctgtgtg tgt=cgajga cgg-gtgtga gaactggatg tgacggcait a:^gg:tcaa 
2161 agctacaata tgacaagtgt ggagtgtgcg gaggggataa ctccagttgt acaaagatta 
2221 tcggaacctt caataaaaaa agcaagggtt atactgacgt tgtcaggatc C':tgaaggag 
i:Sl caacccacs: aaaag:c;ga cagitcaaag ccaaagacca gactagattc ccigczzact 
I'.Al *: age c> :tgaa g^aca.ia'rt ggegagtae- ttatcaargg csagraratg a;:c:;a:tt: 
14j1 cagag.icrat :a:cgaci::; aatggtaccg t=atgaacta csg^ggacjg a:cca-;agag 
2^ r ;l at-gafitct": aratggg^tg ggctattca^ c^scaaaags aatcczgatc g'gcajatcz 
1J21 r.tgrc.i::aga cccaactiaa gcgctaggcg tzcgttacar cif.zzzjtz c::aa:aaca 
2L?1 L-cact:aaaa agtaaactc: 1 : g^catcagcc aiggcagcaa caaegtrgea ::ica::::a 
-acag^L^.a gtgggig^ca ggtccacggc tggcczgctc: caggacrtg- gica^dggjt 
2" 31 ggca-actag gaeegtgrag tgccacgatg giaacagcaa attagctaaa gjatgzcttc 
2"51 tctzt'zagag gccitczzza t^taagcaac gtc-gctgaa gaastigctag cctgtggtLt 
2(21 actctaatgc acaaaaaiac aacaggagga tcat_gcaga tacagctgtg gtgaagacaa 
2r31 ggcctaccca aagracagaa agrcitgezt tcatgt-art gtzs.cca.cga. gtcgaatiat 
2^41 gggcagaa-c zgct--,~zgz gaccaaaagg tttficrcta; tiggtgaatg atrgtaccg*: 

:::: g* 
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FEATLrmS Lccation/Quali f iers 

source 1 . . 1520 

/crganisrn= "Hcrno sapiens ' 

/db_xref= ■ ravr— ■ ?(? ~>6 " 

J chromes cme= "21" 
BASE COU>r? 416 a 372 c 376 g 352 t 4 others 

ORIGIN 

1 ggacattt-ac ttggcctctc ccatgacga- tccaaattct gtgaagagac ctttggttcc 
^61 acagaagata agegcttaat gccttccatc cttaccagca ttgatgeate taagccctgg 
121 tecaaatgea ctccagecac catcacagaa ttcctggatg atggccatgg taactgtttg 
181 ccggacctac cacgaaagca gatcctgggc cccgaagaac tcccaggaca gacccacgac 
241 gccacccagc agtgcaacct gaeatteggg cctgagtact ccgtgtgtcc cggcanggai 
3 01 gtccgtgctc gcctgtggtg tgctgtgg-a cgccaggg=c agatggtctg tctgaccaag 
351 gagtgcagtt tgcctatcgt cactgtaata accctgcccc cagaaacaac ggaegctact 
421 gcacagggaa gagggecate taccactcct gcagtctcat gccctgccca cccaa-ggta 
481 a&tcatttcg tcatgaacag tgtgaggcca aaaatggcca tcagtctgac gcaaaaggag 
541 teaaaactrt tgtggaatgg gttcccaaat atgcaggtgt cctgccagcg gatgtgtgca 
601 agctgacc-g cagagecaag ggcactggct accatgtggc attttctcca aaggtgaccg 
661 atggcactga atgtaggccg tacagtaatt ccgtctgcgt ccgggggaag tgtgtgagaa 
721 ctggctgrga eggcatcatt ggctcaaacc tgcagtatga caagtgcgga gtatgtggag 
7 31 gagacaaccc eager gtaca aagattgctg gaacctttaa taagaaaagt aagggctaca 
Bil ctgacgtggt gaggattcct gaaggggcaa cccacataaa agttcgacag ttcaaageca 
501 aagaccagac tagactcact gecratttag ccctgaaaaa gaaaaacggn gagta=ctta 
961 tcaatggaaa gtacatgatc tccacttcag agactatcat tgacatcaat ggaacagtca 
1C21 tcaactatag cggttggagc cacagggatg acttcctgca tggcatgggc tactctgcca 
1051 cgaaggaaat tctaatagtg cagattctrg caacagaccc cactaaacca ttagatgtc? 
1141 gtracagctt ttttgctccc aagaagtcca ctccaaaagt aaactctgtc actag-catg 
12C1 gcagcaataa agtgggatca cacacttcgc agscgcagtg ggtcaeggge ccatggcticg 
1261 cctgctctag gacctgtgac acaggttgee acaccagaac ggtgcagtgc caggatggaa 
1-21 aceggaagt- agcaaaagga tgrr=-c-=t cccaaaggcr ttc.-cg-t:: aagcaa-=~t 
^91 tgt-gaagaa a-.gttagcct g^gggttatg ate-attge acaaaagata ctggagga-t 
car.~accc = c ccaaCcrjijg tgaacaggaa eg— a~~tta acgra = agaa arccaigc^ 
taa-gaca" gtzraacragg 
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FEATURES Location./ Quali fiers 

source I.. 28-16 



/crganism= "Homo sapiens" 
/db_xre£ = ' taxcn: 9SC6" 
/chron\osc:ne='' 5 " 
1. .2B48 

,'note=" A Disintegrin-like Ar.d Me tallcpro tease dcmain 
with ThroTJscSpor.din type I notifs 6' 
, gene= ' ACAMTS6 " 
22. .2602 

/note="zir.c me tallcpro tease' 
; ~ odon_s tart= 1 

/products' a Disintegrin-like And Ketallcprotease dor-Air. 

with ThrcmboSpcndin type I motifs-6 (ADAM-TS6) • 

;■ trails laticn= ' >EI LWKTLTWIL SL IMAS SEFHSDKRLSYS SQEEFLTYLEZ-rVC. L 

TI P IR\OC'NGAFL SFIVKT XKKSRRRRSMDP I CPQQAVSK1JTK1SAYGKHFHLNLTL 

K~DFVSKHFTVTYV^<I>GPC^^ 

EZEEYT 1 E?LKNTTEDSK?T S'fET'TOHPHVnTySAl^SHLYDHSHCOTSrFTF-SGK? 
WWLIJDTTSTVSYSLPIMJTKIHKRQXRSVS I EJIPVETLVVADKffiiVCJYHGSKD I EKY I L 
SVMN IVAXLYRDSSLGNVVNI IVAKL IVLTEDt P?^IMiADK£U)SFCKl^KSILS 
HC SDGTJT I PENGIAKHENAVL I TRYP 1 GTYKNK PCGTLGLASVAGKCE PERSCS INED 
IGLG SATTIAHEIVhZ^FGMKHDG ZGSSCGHKwTXQQNYGSSrTrcEYQSFFLVCLQSRX 
HKQLFKEVCRELi^LSKS^CJTNSIP 

IDGGWG F.v , SLUGECSF.TCGGir/SS51~-^OTS?APSCX}G^^ PC PL 

GS •^FREXQCADFDNMP FRGKYYtJWK PTTGGGVKPCALNC LAEGYTI7YTERAPAVI DG 
TQCNAE SLDI C INGE CKKVGCUJ I LG S2ARETFCKVCGGGGSTCDA I EGFFNDSLPP.G 
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GY>~~~VYQI PF.j^i I EVREVAKSIOJY IALKS EG CCYY IKGAl' T I ^ iv? RXFITv A3T AFH 
YKRPTDE FES LEALC ?T SEKL rVK-.T-IyC. EQKL/G IF, rKF" ~\T I TRTG5G ~;:\,TVGFTV£ "H 
CPWSEC S ATCAGGKKPTRQ PTQRAKV~.TKh! I LSYALCLLKXLIGI* 1 3CRFASS ^"LAX 



EASE COUNT 837 a 551 c 664 g 794 t 2 others 

crigii; 

1 aaccatccag ^tttctaeat tatggaaatt ttgtggaaga cgttgacctg gattttgag:: 
61 ctcatcatgg cttcaccgga atttcatagt gaccacaggc tttcatacag ttcccaagag 
121 gaattcctca cttatcttga acactaccag ctaactattc caataagggt tgatcaaaat 
161 ggagcatttc tcagctttac tgtgaaaaat gataaacact caaggagaag acgg&gtatg 
241 gaczctattg atccacagca ggzagtatct aagttatttt ttaaacttcc agcctatggc 
3C1 aagcactttc atctaaa=t.t gactctcaac acagattttg tgtccaaaca ttttacagta 
361 gaatattggg ggaaagatgg accccagtgg aaacatgatt ttttagacaa ctgtcantac 
421 acaggatatt tgcaagatca acgtagtaca actaaagrgg ctttaagcaa ctgtgttggg 
481 ttgcarggtg ttattgctac agaagatgaa gagtatttta tcgaacct.tr. aaagaatacc 
541 acagaggatt ccaagcattt tagttatgaa aatggccacc ctcatgttat ttazaaaaag 
6^1 tctgcccttc aacaacgaca tctgtatgat cactctcatt gtggggtttc ggatttcaca 
bbl agaagtggca aaccttggtg gctgaatgac acatccactg tttcttattc a-taccaatt 
721 aacaacacac atatccacca cagacagaag agarcagtga gcattgaacg gtttgcggag 
731 acattggtag tggzragacaa aatgatggtg ggctaccatg gccgcaaaga cattgaacar. 
841 tacattttga gtgtgatgaa tattgttgcc aaactttacc gtgattccag cctaggaaar 
901 gttgtgaata ttatagtggc ccgcttaatt gttctcacag aagatcagcc aaacttggag 
961 ataaaccacc atgcagacaa gtccctcgat agcttctgta aatggcagaa aiccattct: 
1C21 tcczaccaaa gtgatggaaa caczattcca gaaaatggga ttgcccacca cgataatgca 
1031 gttcttatta ctagatatga tatctgcact tataaaaata agccctgtgg aacactgggz 
1141 ttggcctctc tggctggaat gtgtgagcct gaaaggagct ccagcattaa tgaagacatt 
1201 ggcctgggtt cagcttttac cattgcacat gagattgttc acaattttgg tatgaaccat 
1261 gatggaattg gaaattcttg tggacgaaag grcatgaagc agcaaaatta tggcagctc-i 
1221 catta=tgcg aataccaatc ctttttcctg gtctgcttgc agtcgagant acatcaccag 
1231 ctttttagag aagtgtgtag agagctctgg tgtctcagca aaagcaaccg ctgtgtcac? 
1441 nacagtattc cagcagctga ggggacactg tgtcaaactg ggaatattga aaaagggtgg 
1501 tgttatcagc gagattgtgt tccttttggc acttggcccc agagcataga tgggggctgg 
1561 ggtccctggt cactatgggg agagtgcagc aggaciztgcg ggggaggcgt ntcctcatc: 
1621 ccaagacact gtgacagtcc agcaccttca ggaggtggaa aatatcgcct tagggaaagg 
1681 aaacggtatc gctcctgtaa cacagatcca tgccctttgg gttcccgaga ttttcgagag 
1741 aaacagtgtg cagactttga caatatgcct ttccgaggaa agtattataa ctggaaaccc 
1E01 tatacLggag gtggggtaaa accttgtgca ttaaacrgct tggctgaagg ttataatttc 
1E61 tacartgaac gtgctcctgc ggtgatcgat gggacccagt gcaatgcgga ttxactggat 
1521 atctgcatca atggagaatg caagcacgta ggctgtgata atattttggg atctgatgct 
1SS1 agggaagata gatgtcgagt ctgtggaggg ggcggaagca catgtgatgc cattgaaggg 
2C41 ttcttcaatg attcactgcc caggggaggc tacatggaag tggtgcagat accaagaggc 
2101 tctgttcaca ttgaagttag agaagttgcc atgtcaaaga actatattgc tttaaaatct 
;i*>l gaaggagatg attactatat taatggtgrc tggactattg actggcctag gaaatttgat 
22M gttgctggga cagcttttca ttaraagaga ccaa^tgacg aacracaatc cttggaagct 
ctaggtccta cctcagaaaa t"tcatcgtc atcgttcEgc t-zaacaaca gaatttggga 
2341 attaggtata agztcaatgt tzczatcact cgsactgcca gtggacataa tgaagitgg: 
2 4 7) 1 rttacatg^a a teat cage:: ttgstcagaa tgctcagzta cttgtcctgg aggtaagazg 
2461 cccactaggc ag:::ac:ca gagg>:caaga tggagaacaa aacacattc: gagctatget 
2 = 21 zzgzgcttgt taaaaaagct aatt'jgaaac att:::tc:a ggtztget-c aagctgtaat 
2531 ttagcaaaag aaactttget ttaattatat tatattzzat ttgttttcaa cctcatgtaa 
2641 tttgtgcaga tttgttggta aaatacatzt tggcacaatg agtgtctctg ctggtgcttc 
2731 tcccaagact atcttgaagg tgggctgttt gectttegtg aacacattct tggtaaagaa 
2761 catraaaagt tttaaaaaag aaaatgagca agaatcagac aczacagatg caacttcttg 
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ATURES 



Lccaticr./Qualif iers 
1. .3218 



source 



/crganis^= 'Hcmo sapiens' 



CDS 



ger.e 



/chrc.iK!£o~e= ' 15 " 
1. .3213 

'ger.e= -ADAPTS? ■ 
13. .3C03 



/gene^ADAKTS?" 
'note="ZINC KETAUjOPHOTEASE " 
,'codon_start=l 

.'prcduct= ' A Disintegrin-like And Mettllcproteaae domain 
with ThrcrrioSponain type I mDtifs-7 (AEAK-TS7 ) " 
/trans laticn= ■MPGGPSPRSPAPIiRPLUIJX^APGAPGPArc^ 
I^^^A/I^GGSFLSYEl/rtTPALFja^ 

APGFVSSTRRRGGLGRAHIRAfflTACHLLGEVC D PEL3GGLAAI SAC DGLKGVFQL S?: 
EDYFIE PLC SAPARPGHAQPHWYKRQAPERLAQRGDS S APSTCGVQ VYPELES RRER 
WEQRQ2WRRPRIJ*RI^QRSVSKE^^ PQVESYVLT IMNMVAG 

LFHDP5 I GN*P IHITIVRLVLLEDEEEDLK ITHHAENTLSSFCKWQKS INMKGDAHPLH 
KDTAIIiTRm/^AMNTlPCETLGLS HVAGKCQPHRSC SINEDTGLPLAFTVAKELGH 
SPG IQhXGSG^^DCEP^TjKRPFXKSPCI^YCAAPLIWSRCSRQYI TRFLBRGWGLCIXO 
PFAKD I IDF PSVPPGVLWSHaCRI^-^YSAFCEDMDtMnn'l^ SVGTTCHSKLD 
AAVDGTRCGEJIKWC LSGECVP*/GFEPEAVD3GWSGWSAWS ICSRSCGMGVCSAERQCT 
Q PTPKYKGS Y CVGE RKRrRLCNLQAC PAGKP SrRHVQC SHFDAMLyKGQLKW/PVV^ 
^TCELKCRPAl^TYFAKICiRDAV'VDGT PCY37RASRDLC 2JGICKJVGCEFE IESGA 
MEERCG'^CKCa«3STCHT'/SGTFiEAE3LGYVI?^3LI PAGARE IRIQEVAZAANTLALR 
S EL PEKYPLKSGWT 1 AGTTFTYARRGNWENLT S PG PTKEPVWZQVPASRG 
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PGGGSF.GCT^RFSTI^C-RSRP^^ ^ 
E.\SE c:U>n- 554 a 1C41 c -'^ 5 59: r 

"' "^ N : c-sttcctg cca:cccccg cggcccsagt ccrcgcagcc ccgcgccttt gctgcgczez 
£1 ::=c:==tc: tcctctgcgc cctggctecc rgrgcccccg gacccccacc aggacgtgca 
::: acccagggcc gggcggcact ggacatcgtg cacccggttc gagtcgacgc ggggggctcc 
■Rl t-ctgtcct acgagctgtg gccccgcgca ctgcgcaagc ggcacg=atc tgtgcgccga 
241 gacgcgcccg cctt~acga gctacaacac cgcgggcg-g agctgcgccr caacctgacc 
Id gccaatcagc acctgctggc gcccggcttt gtgagcgaga cgcggcggcg cggcggcctg 
3 61 gcccgcgcgc acatccgggc ccacaccccg gcctgccacc tgcttggcga ggtgcaggac 
421 cctgagctcg agggtggcct ggcggccatc agcgcctgcg acggcctgaa aggtg-grtc 
431 cagctctcca acgaggacta c"tcattgag cccctggaca gtgccccggc ccggcctggc 
54i cacgcccagc cccatgtggr gtacaagcgt caggccccgg agaggctggc acagcggggc 
6 21 cattceagtg ccccaaacac crgtggagtg caagtgtacc cagagctgga gtcccgacgg 
661 gagcgttggg agcagcggca gcagCggcgg cggccacggc tgaggcgtct acaccagcgg 
721 tcggtcagca aagagaagtg ggtggacacc ctggcagtag ctgatcccaa aatggtggag 
781 taccaccgac acccgcaggt cgagagctat gtgctgacca tcacgaacat ggtggc-ggc 
341 ctgtttcatg accccagcat -gggaacccc atccacatra ccattgtgcg cctggt;ctg 
= C1 ctggaaga-g i™*™*™* ccraaacacc acccaccatg cagacaacac ectgaagagc 
561 -tctgcaagt ggcagaaaag catcaacatg aacggggacg cccatcccct gcaccargac 
1-21 actgccatcc cgcccaccag aaaggacc-.g cgtgcagcca ^gaaccggcc ctgtgagacc 
10S1 ctgggactgt cccatgtggc gggca-gtgc cagccgcacc gcagctgcag ca-caacgag 
1141 gacacgggcc tgccgctggc cctcactgta gcccacgagc tcgggcacag ctctggca-t 
1231 cagcatgacc gaaccggcaa tgactgtgag cccgttggga aacgac=t-t caccatgtct 
1261 ccacagctcc tgtacgacgc cgctcccctc acctggcccc gc-gcagccg ccagcatatc 
1321 accaggtccc ttgaccgtgg gtggggcctg tgcctggacg ac-ctcctgc caaggaca^ 
-81 atcgacttcc cctcggcgnc acctggcgtc ctctatgetg taagc-acca g^gccgcctc 
1441 cagtacgcgg cccaccccgc cttctgcgag gacacggata atgtctgcca cacactccgg 
l c ,01 tgctctgtgg ggaccacctg tcactccaag ctggatgcag ctgtgjacgg cacccggtgt 
1S61 cgcgagaata agtggtgtcc cagcggggag -gcgtacccg tgggc-tccg gcccgaggcc 
1621 gtggatggtg gctcatctgg crggagcgcc tggtccatct gctcacggag ctgtggcatg 
1651 agcgtacaga gcgccgagcg gcagtgcacg cagcctacgc ccaaa:acaa aggcagatac 

1- 41 tgtgtg-gtg agcgcaagcg cttccgcctc tgcaacctgc aggcctgccc tgctggccgc 
^0" ccctccttcc gc-acgtcca gtgcagccac tccgacgcta tgctctacaa gggccagctg 
1561 cacacatggg tgcccgtggt caatgacgtg aarccctgcg agctgractg ccggcccgcg 
1321 aatgagtact ttgccaagaa gctgcgggac gccgtggtcg atggcacccc c-gctaccag 
^931 g-ccgagcca gccgggacct crgcatcaac ggcatctgta agaacg^ggg ctgtgacttc 
2041 aagattgact ccggtgctat ggaggaccgc tgtggrgtgc gccacggcaa cggctccacc 
2101 ^ccacaccg tgagcaggac cttcgaggag gccgagggtc tggggtatgc ggatg-gggg 
2161 c^atcccag cgggcgcacg cgagatccgc atccaagagg ttgcccaggc tgccaacttc 
2221 ctggcac-gc caagcgagga cccggagaag tacctccrca acggtggc-g gaccatccag 
2251 tggaacgggg actaccaggt ggcagggacc accttcacat acgcacgcag gggcaactgg 
"34 1 aagaacctca cgrccccgcrg tcccaccaag gagcctg^ct ggatccaggt ccctgrctcc 

cccggcccag gcg=qgggag cagaggcgca gtccccaggc ccagcaccct ccatggcagg 
2461 -tc^tcctg gaggagtgag ccc^ggttca gtcacagagc ctggcrctga gccaggccct 

2- 21 cctc = c3C-= cctctacc-c agtttcccca tc^taaaat ggrc"aat::t cgta.ctgca 
2^1 r.tcaca^s ^:.acctgggg -caagcccct ttacgactcg gtggatggag aagacacc-.t 
-;i ; : ,c:c 3 : a = ^ccccg;;: gcccaccrag -gctgttcc agga: 7 agcaa ccc-cgggrg 

L^^rzagt acaccancca caggcaggci ggtgg^cacg acga-jgtccc gccgcccg-g 
._.^: c= - attatgcacc ct^gaccaag tgcacagtca cccccggcag aggtgacaag 
IE21 tgcgccacgc acagccccac ctgcaggggc ttagtgtccg ga=aggga= a crggcttcag 
2=-n ctcccaccrc actgctgggc caccacggg- ttggaagttt gctt:tctga gcctcagttc 
2^41 cccatctgtg a^atgaggct agcgatcgcc ctgtgrccca ggcc-gctgg gagggracat 
^ori crgacgacgca ggtgggtgct ggcrcgcggc gcatgttcag -g-gctccag ctc^ggcgt 
-c^ccc-cca gcg^acaca^ cicccccccg atagaccagt ccagtggccc ctcaccazac 
2121 tgactrattt ccct aaacta tttataaaaa gtagggcaat -tcattaacc ctgaccctta 
3 151 cczgcccggg cggccgctcg agccgagtaa tcactagt 
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Fig. 5A 

10 2C X 40 50 60 70 

tagggcgac tgcacgcgacgccgcggaggacgcgcgc tcgcggcccggggcgccacg^gctcgagt tc tg 70 
ctaggttggctggcgcaggaggagcgggctgcgcgatccagaggggccgccagggaccgccgcgccacgt 140 
gccgctagccgagtcggcctccccacccgattgatcatttttcccggacagagcgacccggccgccccgg 210 
gccaccagcacctgcccgcgcgcggcgatcttcttccctctcccgcgctccgcagcactctgcccccATG .780 
crcnio^axcAXAC^ 350 

360 370 380 390 400 410 450 
''''I i i i i ! i i i i 1 i > i i i m i ' 1 i i i i ! i i i i I i i i n I i i i i 1 i i i i I i i i i I i i i i I i i i i I i i i i I 

T3(3ICTC:SV3CCC^^ 4f0 

G0OC33:^C<X3CGAGQGA^^ 4^0 

03IGA010:A3CTTTX T I^^ 560 

G:XAG30G:<^CTGOGIG3C^ 630 

GAGCTOIXTI^ 700 

710 720 730 740 750 760 770 

1 1 1 1 I 1 1 i t I 1 i t 1 1 1 i 1 1 1 i 1 i 1 I t 1 1 1 I 1 1 1 1 I 1 1 1 t I \ 1 1 t I 1 i 1 1 I 1 1 t 1 I 1 1 1 it 1 1 t 1 I i 1 i 1 I 

G:T3333A0^XXCIG3AC2AC^ 7^5 

g:<TC3:^gccgc^^^ £4~ 

GZ^Cj^&J^^^ SI J 

G^CTOXGIAA^IGCC^ 980 

1060 1070 1080 1090 1100 1110 1120 
: 1 i 1 I f 1 1 1 I 1 1 1 1 I 1 m 1 I 1 1 m I 1 ' 1 1 ! 1 1 1 1 t 1 1 1 1 1 1 1 1 1 I 11 t 1 t 1 1 1 1 1 I 1 1 1 I ii.l ,111! 

o:ip;iAG:<?rcAiGiOA^ 1120 

TGOOO^CTTOTCCA'XmOCAA 1260 
TOO-^JirjnC^:^ 11 ■ c 

'3ITG30AO OAT ?P3TGAC33Ca^2AA:G?GCIGCTC^^ OA 1- r X 

2 41C 1420 1430 1440 1450 1460 1470 

■ 1 l : 1 1 M I 1 I ' l 1 1 1 1 1 1 I 1 1 1 1 i 1 1 i 1 I 1 1 1 1 1 i 1 1 l I 'iiit ill! i L 1 I 1 I L 1 J. \ 

c:er33x^?iA3CT;^33C^:^^ i^ -'O 

TGOG 0O2ATQ 3 0CAA<3IA 2CA^IXl^T3CO:-3C^riCT^ 2C?IGA^^Aa^CXCTGCX T rG JT1T 1 54 C 
OXT32AJ^riGICTAC^ICA>3AGA 0610 
OllX0?3rPX30(XC?a0OGA^ 1630 
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1760 rHO 1760 1790 1800 1810 1S2C 

' ' ' ' f ' ' ' ' 1 ' ' ' 1 ' ' ' ' 1 i l l ■ I i n i I i i i i 1 i ■ i : I M i : I i i i i I M l l I M l t I ' l l , 1 l I l l I 

COXGTTCAiraXA^^ 162C 

CAOXT33T3GCO;TC3GC^^ 139C 

G^CTCTQGTAIIATOGAGAC^^ 195C 

ATACAOTCTO^AACO^ 2030 

GAGIC a AGTACC^TCATGCAA^ 2 10C 

2110 2120 2130 2140 2150 2160 217C 

■ ' ' ' I ' ' » I I ' I I ' I ' ' t l I r i . i t i l l i I i l i : I ! 1 t l I l i t i I i l l i I l 1 l I I 1 I I 1 I I l 1 l I l 1 l I I 
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Fig. 7B (coirt) 
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Fig. 8 (con't) 
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TC7QG3AA3CajODG^^ S40 

ATC^OXAXLiTGCCGGGAA^IXXC^^ S10 

GCCAT33CAAra:X^TTC^^ 980 

CATCT^ICTAC^^ 1C50 

1060 1070 1080 1090 1100 1110 U2C 

i i • i I i i i i I i i i i I i i i i I i i i i I i i i i I i i i i 1 t i i i I i i i i I i i i i I i i i i 1 i i i i I i i i i I : i i i I 

AGAG^OTIGCAGC^^ 1120 

CA7IC33CATC^ 1190 

<x<ttg:h^catiacc^ 12 60 

tfx?tj&n7iq33x?7j^ 1330 

t33ca^3ggccaagccik:3at3ca^ 1 4c3 

1410 1420 1430 1440 1450 1460 1470 

! i ■. I I i I i i I i ■ i i ? i i i i I i : i i I I i I I I i i i i I i i ■ i ' t i i i I i ■ < I I I I I I I I I I I t I 1 I I I I I I I I 

T03^T±C<JJVC3CQ^ 1610 

CCXXSkOCTOIt^33G^ 1680 

AAGTACreiCTOGGCT^^ 1750 
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1760 1770 1760 1790 18CC 1810 1820 

t I i I 1 i i I I I r I I t I i i i i I ■ r ■ r 1 ■ , , i I i ■ i i 1 i t l l i l l l i 1 i l l l I i i i ) I . . i r I r > t i I i l l ) I 

ACTITCAGAGAAGT^ 1820 

GTAC033GGAGGGGGCC7I^ 1890 

AGGGCGGCAGCCGIQ^ 1960 

a^GCACGIX3GGC^<X^^ 2030 

TGhCG&CNTTGCCIXtt 2100 

2110 2120 2130 2140 2150 2160 2170 

■ ' > ■ I ' ' ' ' I ' ' i ' 1 ' ' ' ' I ■ ■ ■ ' 1 i * i i I i i i i I i i i i 1 i i i i I i i i i I i i i i I i i t i I i i i i I i i - i j 

(TIXXTieK^TIOOCAAA^^ 2170 

CCCTGAAujGAGaC^^ 2240 

TCTAC<7IU^a^<X^ 2310 

ATTAATGCATCTTCICAl^^ 23S0 

CCC02ATCGCCCGTGACTC^ 2450 

2460 2470 2480 2490 2500 251C 2520 

i i i i I i i i i I t i i i I i i i i 1 i i i i I i i i i I i i i < I i i i i I i i i i I t i i i 1 i i i i I i i i i I i i i i I i i i i 1 

CTolTXAGGCGGTAQCGAGGI^^ 2520 

CA^ACIGCAGTQCC^^ 2590 

ACTGjCJTIGEftGGG^ 2660 

GHI5CCAGCGCCGCGTC7IUIQ^ 2730 

<XTCIACTCGAGQCCrc02 2800 

2810 2820 2830 2840 2850 2860 287C 

i i i i I ; i i i I ; i i i I i i i i I i ■ i i I i i i i I i i i i 1 ■ i i i I i i i i 1 i i i t I t t i i I i i i i I i i i i I i i i i I 

CD2C^AGCmn3GGCCia?ICIC^ 2870 

3CCCCCT03Ga>C^^ 2940 

CCTO2CCG2TOGG7IGACC^ 3010 

:ACT:<33CTCCACCAGCCA^ 3080 

■X^2A3TX^1^3GCCA^ 3153 

3160 3170 3180 3150 3200 321C 3220 

1 1 ' 1 i ' ' ' ' I j ' ' ' I ' ' . i I i i i i I i i ' i I i i i i 1 i | i i ! : i i i I i ' i i I i i i r 1 i M : I : I I i ., 1 1 : 1 

AA<:^GGtfOG:7riACr2OT XTACirirGOCAGA-XTr 32 2 3 

C^2TG^AA^CT3C2AAG3C02Ctagggracc tggaaccaacctggagcacaggctgaggcaggggacat 3293 

c c cac t ggagagggca tgacggaaagcgcggc t tgaa t t gaaggg^gaga t gcagt tgaaagr ta 1 1 ta i 3360 

tgggtaaccctaca^sggctcctgactaaggggtggagaagagctggctacccagggaccctcrgctgta*: 343 3 

cttgcccagrtgatagtgaagagagacgacrccttgttgcacaca^atttaagtccctagcacccctccc 3500 
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Fig. 9A (con't) 

3513 3520 3530 3540 3550 3560 3570 
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accc^ttgaccgga^tatgtactgtgaagagtgcgggrggggaggggtgtgctggtgccctgcccccrgc 3570 

actgrtctatccctacactctgagctgggggcacrcatatctgctatggggggagtaggcttgataccac 3640 

ctccctgtagccctcccccagactgacgaagcggaagatccaccccaacccctgccctgcctgccccagg 3710 

ggggagttcaacatccaggccgtcccccatcatggtgctacaagccctgccctggggcccacacactcct 3780 

caccaagaagccttacattaaaaaagttgtgttatcctacaaaaaa^aaaaaaaactcgagggcgggccc 3650 

3860 3870 3880 3890 3900 3910 3920 
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ggtacccaattcccgctatagtaaatngggtntta 3885 
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SRTPSG^KMSSCF^R^^^ 40 

GAAPFKGGKSKVPPLT^ 80 

SVSYV/TCHSL^-R^^ 120 

GOJ^:^/ADEEm 160 

SSLF^PHTXtfAO?^^ 200 

210 220 230 240 A 

ETERO^PGU^'S^YV^ 240 

YVLAI>£JIVAKIi^3SSLjG^ 230 Wr^.^/,. 

I^IHAGKSLDSFCKW^ 320 

VLITRYDICnTOXFCCni^^ 360 

AATSVf IKCH5ZC^rrFT^H^ 400 

410 420 430 440 

I7>lK"rrJFFWSSa^^ 440 

YPIVAPGQAYIWDBQCR^ 480 

KSNRCITNSIPAAB?I1j3^ 520 

EGVDGA^JGIWF/raXSRT^^ 560 

pafCKSRRI^SCOT^^ 600 

610 620 630 640 

■ ' ■ * i ■ » ' ' i ' ' ' ' i ' ' < ' i ' » ' ' i ' ' ' ' i ' ' ' ' i ' t < i i . 

:-;F/KWr;TYRGGa~<A^ 640 

CRCTVDia/SGEE^^ 680 
ACETIB^^SPASPGAGYEDVV/ttPKGSVHIFIQDUaL^ 720 

SHLi\U<GDQESIJLIj^^ 760 

<^52SL5^PINrtSLIW^ 800 

810 82 C 830 840 

! ' ! t i ' ' ' ' ' ' ' ■ ' - [ ' ' ' ' I ' ' I i I ! ' 1 ' I ' ' ' ■ I ' ' ' ' I 

rriCSAHSKLPXROFA^ £ SO 

vr^SV\\,^RKV£AAEK<^^ 920 

CPP3-RT1J>JSEOT 960 

HCL?AAKP?^Ir.nSILRrc^ 1000 



WO ill 1 HT4 PCT.'l SUM- 21223 

2" 54 

Fig. 9B (corTt) 



1010 1020 1030 1C40 
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GQR^CTSH?32FSREC^^ 1040 
C^?EHIKM^7\YCPLVK<F^ 1060 
R 1081 
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Fig. 10A 
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AGCAGCAGCTGTCGTO^ 40 
ACGGTraGACATTIOICTC^ 80 
GOT3TCACMG3TO^ 120 
AT3XGTOITJ[CT3GGGGI^A^ 160 
ATIGAAQGIGIUiTTAGQCXIftGC^ 2C0 

210 220 230 240 

I I I I I ■ I I I i 1 1 ) 1 I I I I I * I I ' ' 1 ' I > ' ' ' I ' ' ■ 1 1 



ATCAGGACCJIOGTXTC^ 240 

TTTZ^TCCAAGAT^ 230 

CTAAAGGGGGACCAAGAGTC^ 320 

CIGCXACCCCCCAAjC^^ 360 

CACATTICATOTOGGO^^ 400 

410 420 430 440 

* i > i I i i i i I i i i i I i i i i I i i i i I i t i i I ' i i i I i i i i 1 

CT3GAAGOXTaGGACCC^ 440 

TGJIXOTX^GCCCAGGCA^ 430 

CTTCAATGCACXXATIQCCC^ 520 

TOr^^CIATraXOrKGACCAA^ 560 

GfI02£03CXjGC2iG^ 6C0 

510 620 630 640 

■ ■ ■ ' I ' > ■ ' I ■ ' 1 1 1 1 ■ 1 ■ I ■ 1 1 1 1 1 1 1 1 1 ■ ■ ■ 1 t 1 1 ■ ■ I 

TC^OITOCiACAG^ 640 

GGXA2AGTAAATIIXC^ 680 

CA1AA02ATGTCCACCA^ 720 

AOSCTCOX^nTrAiX^ 750 

TG-iJITOIGIGCC^CG'IX^^ 800 

810 82 0 83 0 84: 

i ; i i 1 i ■ ' ' t ' ■ ■ ' I ' ■ ' ■ I ' ' ' ' I ' ' ' ' i ' ' ' ' i ' ' ' ' I 

AAGCCTEAGACGACA^ 840 

TCILC<7IX3GAOXCTCCC^ 830 

TO2GCAA!XCSaSACn^^ 920 
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Fig. J OA (con't) 



1010 1020 1030 1040 

; 1 1 > 1 1 ■ 1 1 I ■ 1 1 1 I 1 1 ■ ■ ■ ■ 1 1 ' 1 1 1 I > * 1 * I 1 1 1 1 I 

GC^GCCaftGO^^^ 1040 

GCTGCCCXCTGCCOX^ 1080 

GTCITCCACACAC^^ 1120 

GTETOGeiGCACCAG^^ 1160 

GCACTGAAGCCTItXXa^^ 1200 

1210 1220 1230 1240 

i i ' ' 1 ' » 1 ' I ■ < 1 ' I 1 ■ 1 ' I ■ ' ■ ' I ' ' i i 1 * 1 1 ■ I 1 1 ■ ■ I 

GGCCAAXXTIGAG&J^^ 1240 

GAAGAATCC^AGGATS^^ 1280 

TGCTD'TTnr^A^T^ £XXn7£TICCC 13 2 0 

CG£GATT7Tt3^^ 1360 

TGGAAOIMCCK3G^^^ 1400 

1410 1420 1430 1440 

' r i i I i t i i 1 i i i i I i i i i I i, i f i 1 i i t i i i i i i I i t i i I 

CACI^ a £A3GGC^^^ 1440 

GGCTPGAGATGCAAG^^ 1450 

TACAGGQCTICIGrtCTW 1520 

CCCC^32GAC07ITITCr^^ 1560 

TC^GAG/^GAGGAC i 'lL'riGGKxMACACAL U T/l AAGDCC 1600 

1610 1620 1530 1640 

' ' ■ ' l ' ■ ' ' I ' ■ ' ' I ' « ' ' 1 ' f i ' I ' i i ' I < ' * ' I ' ' ' ' f 

TTAGAQlCTIXXACQTr^ 1640 
GJ 1642 
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AAAVvTCTTPlRPE^^ 40 
CKTO3CTCSACSriEX^ 80 
FI QDI^C^SLSHIALKGDQSST ,T ,T .^HT ,FCTPQFXHL?LXCT 120 
TraH^PDQAQSLEA^ 160 
Fl^IAKDALPFYSW^^ 200 
210 220 230 240 

QIIjSSAVAKTOSCSISKLP^ 240 

RCSRSa^wTCSRSWCQR^^ 280 

\ tt y L r^y p^'ip pp HH^TT I?v\ T SBCTPSOC p f? J 3H1?VVT CXFA 320 

DQRSTLPPGHO^AAKPPSTl^^ 360 

CSTQCZZGQQQRWR^ 400 

410 420 430 440 
» ' ' ' I ' ' ' ■ I ■ ' ' ' I ' ■ ■ ' I ' ' ' ' I ' * ' ' I ■ ■ 1 ■ I 1 ' 1 1 I 

AKO)SWPPGDGPEEH<TC 440 
C^ITKTOQGR 450 
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Ligated 459225+482392 with Sac I(168)&Eco RI(or Not I) 
Cloning site:5';Eco RI 3'; Not 2 Vector; PT7T3 pac. 

You can put this construct to pcIX^3 . 1 { + ; for trans feet ion 
S'-UTR is 50bp k3 ' -'JTR is 175op 

210-215; in 482392 it's TCCTAC(SY) . 



10 20 30 40 

ill i 1 i i i » I t i i I i ' ' ' I ' ' ' ' t ' ■ ■ 1 I ■ ■ 1 > ! * ■ ■ > t 

gaat tcggcacgaggcagtgtccgat tc tgat t ccggcaa 4C 

cgatccaagcATOGAATOCTQCGoICGQ^ 80 

ACACT3CTOJrCITICIQ^ 120 

C<IACC3CACgctCX^AG^^ 160 

TGCOIGGGGCCCATCG^^ 200 

210 220 230 240 

' ' ' ' 1 ■ ' ' ' I ' ' ' ' 1 ' ■ ' i 1 ■ ' ' ■ I ' ■ ■ ■ I ■ ' ■ f 1 ■ > ' I 

( 2GT3GQGCX?GCC2^ 240 

AGAGCIGIGAAGGAAGA^ 250 

TAAIGIX3GACIGOCCAC^^ 320 

^2AGCAATC<riCAGCICATAA'^ 360 

A3TITEAIG/ViTGGCTIO^ 4C0 

41C 420 430 440 

i l I t 1 i i i i I i i i t I i r i i I i i i ! I i t i i I i i i i 1 i i i t I 

OCC^ r ^ITCAC T 2CAAGTGCCAAGC^ 440 

C riGITCAACTAGCAJDC^ 4S0 

G^ATACAGAATCTITO:^^^ 520 

OIAAATIGIT3GCTC<I^^ 560 

A 3 ^A^TAS£ f TGTra 6J0 

61C 620 630 64C 

1 1 1 1 I t 1 1 1 I 1 1 r 1 I [ I I I I L I ! I I I I ! I I 1 1 1 I I L J 1 i I 

CCI02CaXT3Gria^AGQGGAG^ 640 

e^^XAAATCXXM^ 6-J0 

G^XTTAGACATATT^^ 720 

ACTTATATCTOGAAACTAAAA^ 750 
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Fig. 11A (con't) 



810 323 320 S40 
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Mr^A3TGTIQ3Al7^ 640 

TA22X^AGAA?GGCTCGAC^ 530 

CAAGAIT?OGTAACICGQQ^ 920 

riCATOITCTAl^.CCCAT^^ 960 

CX3GATTICITrariTGaXL 1000 

1010 1020 1030 1040 

i i i i I i i t i I i i i i I i r t i 1 i i i | I i i i i I i r i i 1 1 i i i 1 

TO^GCIGACATUSQ^^ 1040 

OGTOIGCtITCCTC^CA 1080 

AGCTC:wrx2CAAAC^^ 1120 

TOI7IT3TO2AGCCA£^ 1160 

TATa^ICTAC^TCCCXri^^ 1200 

1210 1220 1230 1240 

i i i i 1 i ' ' ' I ' ' ' ' I M ' i I i i i i 1 ' ■ ' ' I ' ' ' ' I ' ■ ' ' I 

cat3gs£xxxx?tgc^^ 1240 

>3AGOX<3QCAa7ITCCI^^ 1280 

CATGTCACITCAG?3GA^^ 1320 

CTAfiGAIOXX^^ 1360 

'IXXTAaJViaSCTQG^^ 1400 

1410 1420 1430 1440 
■ i i i I i i t i 1 i i i i I i i t i I ? i i i I i i i i I i i i i 1 i i i i J 

;CGIGTGGCCAGGGC^^ 1440 

TCGACCAT03AJ3GAATGCACT 1480 

AACAAAGCCCCACATAAAAGAGGAAI^ 1520 

CO^SCTATAMCCOW^^ 1560 

AL?TTGCCAIG?ITCA^ 1600 

1610 1620 1630 1640 

J-J..I 1 1.1 1X1 1 1 J—i-L 1 1.111 1 Mill Mill I I I I I M 1 lJ 

XOT3CTlCT:7ICAGA3^^ >-4C 

c^^rrgttctaratttgaaactt-tgrttaaagaaagca 168C 

gtgtcicactggt:tgtagcr.ttcargggt:t:ctgaactaag 1720 

tgtaaicatctcaccaaagctttttggctctcaaattaaa 1760 

gattgattagtttcaaaaflaaaaaaqaflaaaagatgcggc 1300 
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Ligated 459225+482392 with Sac I(168)6£co RI(or Not I) 
Cloning site:5';Eco RI 3 1 ;Not I Vector; PT7T3 pac. 
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FXSBCSRTCGC<iAANSLPP^^ 30 
C?PEAGDFPA£QCSAHNE~V^ 120 

ixcQKtsrnj^^ ieo 

GO^QLGSTV?S[lTOVCWG^ 2C0 
210 22C 230 240 

i i i i I i i i i I i i i i I i i i i 1 i r ] i I r t i i 1 i i i t I i t i • I 



SCtfT^AIPYGSRHIRLVTr^ 240 

LSSTCITO^SSVDF^rTP 230 

NSGSAD^^FITOFIIKH^ 320 

SABCYDIJ^JFfv^'AI^ 360 (C ) V V f>£. N 1 K r K r/< L ^ c 

ASDGYKQZMFiTDLY:^^ 400 

41C 420 43C 440 
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Kr!ARY7r^P:XFGS\^iEm^ 24C 
LTTa^SIZVSI^AATIIi^^ :SPZ3SCAVIE2D 4C 0 



3LH^FI\^jHEIOiUJ3LS^ 4 g 0 

-CTiXGI^xU KPCES^gl^^i SILTSIZftSKHCTJCTSAI^TEFL^ro^JTr .7 .HT ,P^m 

^ Dis *_ * * * « * * 

YifSTSSHGN^SVJGF/^CSR^^ JSC^VCIGKFJCTRSCS^^ 640 

* * * * * * 

SDAKS3VKTFVSWFKY^ 1 3SKLQYEK 

* * *| — ► Spacer domain 

0^O3G^SSCIXi:GTFrcKKS^ ID 8C j 

EXJ^flSYSG^raEIHS^ tf^3?HSI£LCWV S60 

* * * * * * 

Fig. 13 
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rq^ysi^^ttt^to:^^ -:adk 320 

★ * * * * 

sl:sfcki-jcks;^shcisix^ 4 0 0 

* * * I— ►Dis + * 



I=£EZVr?«rai^ 480 
* * * * * * * ★ 

7^PC PuGSRDFPK<QC^DFIIOI?F^ -CEC 640 

* * *| — ^Spacer domain 



* ★ * —A + 

VgBCSATCaj3GKM?TFQP^^ 860 

c 

I-IFOGPSF^PAPLIilPIIJLlir^ S 0 

* * 

i-yeijqyrgf^ct^t;^ i 6 0 

★ 



* * * f->Dis * 

SQIXS^nKRPr S?QULYI^^ 430 

* * * * * * * 

I ; * * * * J — Spacer domain 

t \K?Lm^V\03TPC^'C , ^^£^LC^GICKI FA 720 

GJ&E ZRICEVTSAMJFLJ^SECF^^ 3 0 0 

C<^GGCTPSILB3 ; iHPGGVSPGS\^^ 330 

* * * * 

C/7-.T7^1Z T .TZF£EICrSir^-^^ZA^^7-^:v , H0 937 



Fig. 13 (con't) 
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Fig. 13 (con't) 
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FIGURE 16 



Pa 



I-'IS3CrV-r;P^'IR3?S?P^JITiaiaVPSRHLLP 4C 

^/EOTira3T^^ 120 
(X-^HGLIVADEE^^ 160 
SSL^HK^IXTACX^TOH^F^^ 200 
EIH^FGIJ<RSVSI^YVE^^ 240 
Tv^AJMNIVAKLfQDSS^^ 280 
ITHhAGT^LDSrcKTOKS IVI\H3GKGNAI PEH3VMmHDTA 320 
VLITRYDICIYKNKPGGTri^^ 360 
AATSVHHCHEIGKTFXjMNHEXjV^ 400 
im*^IPF\AVS3CNFUOT^ 440 
YPTvT^GQAYDAI BQO^HGVKSRQCKYGEE^Sm^CLS 480 
KuS^FCITNS I PAAEraXjCQIHTIDI^^ 520 
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Arc Glv Ser Trp Ala Glu Tyr Glu Arg lie Tyr Gly Asp 31/ Ser Ser 

175 130 155 

10 cgc ate ctg cat gtc tac aac cgc gag ggc tct age ttc gag gcc ctg 

Arg lie Leu Kis Val Tyr Asn Arg Glu Gly Phe Ser Phe Glu Ala Leu 

190 195 20C 

ccg cca cgc gcc agt tgc gag act cct gca tec cca tct gcg ccc caa 

15 Pro Pro Arg Ala Ser Cys Glu Thr Pro Ala Ser Pro Ser 31/ Fro Gin 

2 0 5 2 10 215 

gag age ccc teg gtg cac agt aga tct agg aga cgc era gcg ctg gcc 

ijlu Ser tzo Ser Val Hit Str: Aiy Sex Aiy Aiy A.iy Ser Ala Leu Ala 

20 22C 225 23C 235 

ccg cag ctg ctg gac cac tea get ttc teg cca tct ggg aac gcg gga 

Pro Gin Leu Leu Asp His Ser Ala Phe Ser Pro Ser Gly Asn Ala Gly 

24 3 24 5 25 0 

25 

cct cag act tgg tea agg cgt agg cgc cct tec ate tec age gcc cgc 

Pro Gin Thr Trp Trp Arg Arg Arg Arc Arg Ser lie Ser Arg Ala Arg 

255 25C 265 

30 cag gtg gag etc etc ttg gtg get gac tec tec atg gee agg atg tat 

Gin Val Glu Leu leu Leu Val Ala Asp Ser Ser Met Ala Arg Met Tyr 

270 275 260 

egg egg ggc ctg cag cat tac ctg ctg acc atg gee tec ate gcc aac 

3 5 Gly Arg Gly Leu Gin His Tyr leu Leu Thr xet Ala Ser lie Ala Asn 

255 290 295 

agg ctg tac agt cat gca age att gag aac cac ate cge ctg gcg gtg 

Arg Leu Tyr Ser His Ala Ser He Glu Asn His lie Arg leu Ala Val 

403 00 3C5 310 215 

gtg aag gtg gtg gtg ctg acg gac aag gac acg agt ctg gag gtg age 

Val Ivs Val Val Val leu Thr Aso Lys Asp Thr Ser Leu Glu Val Ser 

320 315 330 

: , gee - . : ' J arc c t ■.; ^a: :.tt : ■ tgg 1 = lc - : " 

Ivs A en Ala Ala Thr Thr leu Ivs Asn Phe lys Lys Trp Gin His Gin 

ii: 345 

I ' cat aar rag rta gcg car gat car csa cac car "a- gar cca g~r at - 1 1 ~ 

His Asn Gin leu Gly Asp Asp His Glu Glu His Tyr Asp Ala Ala lie 

350 355 3S0 



770 



E.1E 



866 



914 



962 



id-; 
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He Glv His leu Leu Gly leu Ser His Asp Asp Ser Lys Phe Cys Glu 

415 420 425 

gag aac etc ggt act aea gaa gac aag cgt eta atg tct tea ate etc 

5 Glu Asr. Phe Gly Thr Thr Glu Asp Lys Arg Leu Met Ser Ser He Leu 
.330 ' 435 440 

acc age ate gat gca tec aag ccc tgg tec aaa tgc acg tea gee ace 

Thr Ser He Asp Ala Ser Lys Pro Trp Ser Lys Cys Thr Ser Ala Thr 
10 445 450 455 

ate aca gaa ttc ctg gat gat ggt cat ggt aat tgt ttg eta gac eta 

He Thr Glu Phe leu Asp Asp Gly His Gly Asr. Cys leu Leu Asp leu 

460 465 47C 475 

15 

cca egg aag cag at- ttg ggt ccc gag gaa cte cca gga cag ace tac 149C 

Pro Arg Lvs Gin He Leu Gly Pro Glu Glu Leu Pro Gly Gin Thr Tyr 

460 435 49C 

20 gat gee acc cag cag tgc aac ttg aea ttc ggg cct gag tac teg gtg 153& 

Asp Ala Thr Gin Gin Cys Asn leu Thr Fhe Gly Pro Glu Tyr Ser Val 

495 ' 530 535 



tgc cct ggc atg gat gtc tgt gcg egg erg tgg tgt get gtg gtg cgc 

2 5 Cys Pre Glv Met Asp Val Cys Ala Arg Leu Trp Cys Ala Val Val Arg 

510 515 520 

caa ggc caa atg gtg tgt ctg acc aag aag ctg ccg get gtg gag gge 

Gin Gly Gin Met Val Cys Leu Thr Lys lys Leu Pro Ala Val 3lu Gly 

30 525 ' 530 535 

act ccc tgt ggg aag gga aga gtc tgc ctt caa ggc aaa tgt = tg gac 

Thr Pre Cys Gly Lys Gly Arg Val Cys leu Gin Gly lys Cys Val Asp 

54 0 * 54 5 5 50 555 

35 

aaa act aag aaa aaa tat tac teg aea tea age cat gga aat tgg ggg 

lys Thr Lvs lys lvs Tyr Tyr Ser Thr Ser Ser His Gly Asn Trp Gly 

56 C ' 5 65 5 70 

4C tec tgg ggc ccc tgg ggt cag tgt tct cgc tea tgc ggg gga gga gtg 

Ser Trp Gly Pre Trp Gly Gin Cys Ser Arg Ser Cys Gly Gly Gly Val 

575 580 585 

eac ttt gee tac cgc cat tgt aat aac cct gea cct cga aac agt ggc 

45 Gin Phe Ala Tyr Arg His Cys Asn Asn Pre Ala Pre Arg Asn Ser Gly 

59: ' 5?I HO 

cgc t a c tgc aci ggg aag agg gee at a t a e cgt tec tgc a 2 '- ? - - 

Arc Tv i- Cv= Thr Gly Lvs Arg Ala He Tyr Arg Ser Cys Ser Val Thr 

H " €15 * n: 



ccc tgc cca ccc aat ggt aaa tct ttt cgc cat cag cag tgt gaa gee 

Fro CVs Pre Pro Asn Gly lys Ser Fhe Arg His Glu Gin Cys Glu Ala 

621 625 63 C 6 35 

5 5 

aaa aat ggc -at rag ter gat gca aaa gga gtc aaa aea ttt gta gaa 

lvs Asn Glv Tvr Gin Ser Asp Ala Lys Gly val lys Tnr Phe '.'a: Glu 

■ ■ ; - '4 - 



: 3 4 e 



1394 



144 2 



1566 



1534 



16c2 



173 C 



1778 



1526 



1922 
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gc: acg gat: ggg a:: gaa cgc agg ccg cac age aac ect gcg eg: gtc 2114 

Val Thr Asp Gly Thr Glu Cys Arg Pre Tyr Ser Asr. Ser Val Cys Val 
e 6 5 5 5j 695 

cga gga egg tgc gtg aga act gga tgt gac ggc ate att ggc tea asg 2162 

Arg Gly Arg Cys Val Arg Thr Gly Cys Asp Gly He He Gly Ser Lys 

7CD ' 705 7IC 715 

10 eta caa tat gac aag tgt gga gtg tgc gga ggg gat aac tec agt tgt 2210 

Leu Gin Tyr Asp Lyg Cys Gly Val Cys Gly Gly Asp Asn Ser Ser Cys 

720 725 730 

aca aag att ate gga aec ccc aac aaa aaa age aag ggt tat act gae 2253 

15 Thr Lys He He Gly Thr Phe Asn Lys Lys Ser Lys Gly Tyr Thr Asp 

735 "?43 745 

gtt gtg agg ate ect caa gga gca ace cac aca aaa etc cga cag tte 233b 

Val VH Aiy IH Glu Gly Ala T.^ His He LyS Val Arg Glu Phe 

20 750 755 76: 

aaa gee aaa gac cag act aga ttc ect gec cac tta gec ctg aag aag 235-* 

Lys Ala Lys Asp Gin Thr Arg Fhe Pro Ala Tyr Leu Ala Leu Lys Lys 
765 77C 775 

25 

aaa act ggc gag cac ctt acc aac gge aag cac atg att tec act tea 2432 

Lys Thr Gly Glu Tyr Leu lie Asn Gly Lys Tyr Met He Ser Thr Ser 

780 765 790 735 

3G gag aec ate ate gac ate aac ggt acc etc atg aac tac age gga tgg 2450 

Glu Thr He He Asp He Asn Gly Thr Val .Vet Asn Tyr Ser Gly Trp 

800 SC5 aic 

age cac aga gat gat tct tta cat ggg atg ggc tat tea gec aca aaa 2i9S 

35 Ser His Arg A3p Asp Phe Leu His Gly Met Gly Tyr Ser Ala Thr Lys 

815 623 825 

gaa ate ctg ate gcg cag ate etc gee aca gac cea act aaa gcg eta 2546 

Glu He Leu lie Val Gin He Leu Ala Thr Asp Pre Thr Lys Ala Leu 

40 330 335 S40 

ggc gte cgc cac age ttc ttt gtt ccc aag aag aec acc caa aaa gta 2 5*4 

Gly Val Arg Tyr Ser Phe Phe Val Pro Lys Lys Thr Thr Glr. Lys Val 

£ 4 5 8 5 C 555 

45 

aac : ct gr c aac ag e cat ggc age aa: aag gcg gga era :?.: tct aca 2 •' 4 2 

As:. Ser Val He Ser K:s Gly Ser Asn Lys Val Gly Pre His Ser Thr 

3*3- i--: : f. 7 

" ' rag _ tg rag cg^ ::tg ara cgr ego Hg q~c tac t:: igg ac: rgt 2-'-' 

Gin Leu Gin Trp Val Thr Gly Pro Trp Leu Ala Cys Ser Arc Tr.r Cys 

e 3 o ess 5. ? :• 

gac aca ggc tgg cac act agg acc gcg cag tgc cag gac gga aac agg 2HS 

5 5 Asp Tnr Gly Trp His Thr Arg Thr Val Glr. Cys Gin Asp Gly Asn Arc 

3H " 5H' H5 

aa 1 t a a a a :-*:a t "■ r ct t ~ t c c rc z-\~ ect ttt - c 3 ttt aac 2~H 
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agt:a:gc:: :ca:g:cot: g;ca::ac=a c — gaa^tat: ggg:daaa:c :g:tc:ct^ 2S£: 
gacraaaagg ::ta;:ctac ttggtgaarg acggcaccrgt: ga 33C2 



<21C> 2 

<212> PRT 

10 <213> mus musculus ADAJ-1TS-5 

<4CC> 2 





Met 


Arg 


Leu 


Glu 


Trp 


Ala 


Ser 


Leu 


Leu 


Leu 


Leu 


Leu 


Leu 


Leu 


Leu 


Ser 




1 








5 










10 










15 




15 


Ala 


Ser 


Cys 


Leu 


Ser 


Leu 


Ala 


Ala 


Asp 


Ser 


Pro 


Ala 


Ala 


Ala 


Pro 


Ala 










20 










25 










30 








Gin 


Asp 


Lys 


Thr 


Arg 


Gin 


Pre 


Gin 


Ala 


Ala 


Ala 


Ala 


Ala 


Ala 


Glu 


Pre 








35 










4C 










45 










Asp 


l; In 


Fro 


Gin 


Gly 


Glu 


Glu 


Tin 


Aig 


Glu 


Aj_ y 


Gly 


Hib 


L=u 


Gin 


ri" O 


20 




50 










55 










CC 












leu 


Ala 


Gly 


■31 n 


Arg 


Arg 


Ser 


Gly 


Gly 


Leu 


Val 


His 


Asn 


He 


Asp 


Gin 




65 










70 










75 










SO 




Leu 


Tyr 


Ser 


31y 


Gly 


Gly 


Lys 


val 


Gly 


Tyr 


Leu 


Val 


Tyr 


Ala 


Gly 


Gly 






















^ -J 










9 5 




25 


Arg 


Arg 


Fhe 


Leu 


Leu 


Asp 


Leu 


Glu 


Arg 


Asp 


Asp 


Thr 


Val 


Gly 


Ala 


Ala 
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110 








Gly 


Ser 


He 


7a 1 


Thr 


Ala 


Gly 


Gly 


Gly 


Leu 


Ser 


Ala 


Ser 


Ser 


Gly 


His 








115 










12c 










125 










Arg 


Gly 


His 


:ys 


Phe 


Tyr 


Arg 


Gly 


Thr 


Val 


Asp 


Gly 


3er 


Pro 


Arg 


Ser 


30 




130 










135 










14C 












leu 


Ala 


Val 


Phe 


Asp 


Leu 


Cys 


Gly 


Gly 


Leu 


Asp 


Gly 


Phe 


Phe 


Ala 


Val 




145 










150 










155 










160 




Lys 


His 


Ala 


Arg 


Tyr 


Thr 


Leu 


Lys 


Fro 


Leu 


Leu 


Arg 


Gly 


Ser 


Trp 


Ala 












165 










170 










175 




35 


Glu 


Tyr 


Glu 


Arg 


He 


Tyr 


Gly 


Asp 


Gly 


Ser 


Ser 


Arg 


lie 


Leu 


His 


Val 










ISO 










155 










190 








-vr 


Asn 


Arg 


Glu 


Gly 


Phe 


Ser 


Phe 


Glu 


Ala 


Leu 


Pre. 


Pre 


Arg 


Ala 


Ser 








195 










20C 










205 










Cys 


Glu 


Thr 


Fro 


Ala 


Ser 


Pro 


Ser 


Gly 


Fro 


Gin 


Glu 


Ser 


Pro 


Ser 


Val 


40 




210 










215 










22C 












His 


Ser 


Arg 


Ser 


Arg 


Arg 


Arg 


Ser 


Ala 


Leu 


Ala 


Pro 


Gin 


Leu 


Leu 


Asp 




215 










23 0 










235 










24 0 




His 


Ser 


Ala 


Phe 


Ser 


Pro 


Ser 


Gly 


Asn 


Ala 


Gly 


Pro 


Gin 


Thr 


Trp 


Trp 












24 5 










2 50 










255 




45 


Arg 


Arg 


Arg 


Arg 


Arg 


Sei- 


Tie 


Ser 


Arg 


Ala 


Arg 


Gin 


Val 


Glu 


Leu 












:ec 




















2^0 








L e u 


Val 


Ala 


\sr 


S-r 


Ser 


Met 


A 1 a 

zee 


Arg 


y •■ < - 


Tyr 


Gly 


Arg 

2 z E 


G 1 y 


Leu 






H 1 £ 




Leu 


Leu 




y.-z 1 


Al a 

- 


Ser 


He 


Ala 


Asm 


A:g 


Leu 


-y- 


Ser 


Hn 




Ala 


Ser 


lie 


:-iu 


Asn 


His 


He 


Arg 


leu 


Ala 


Val 


Val 


Lys 


Va 1 


Val 


Val 




305 










310 










3 15 










320 




Leu 


Thr 


Asp 


Lys 


Asp 


Thr 


Ser 


Leu 


Glu 


Val 


Ser 


Ly s 


Asn 


Ala 


Ala 


Thr 
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3 3 0 










3 3 5 






Th r 


Leu 


Lys 


Asn 


Pr.e 


Cy- 


Lys 


Trp 


Gin 


i:is 


Gin 


Kis 


Asn 


Gin 


Leu 


^-"^ j 




















34 = 










3 5H 








Asp 


Asp 


His 


Glu 


Glu 


ril 3 


Tyr 


Asp 


AL a 


Ala 


He 


Ls u 


r-he 


T h r 


Arg 
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Thr Glu Asp Lys Arc Leu "et Ser 5er lie Leu Thr Ser He Asp Ala 

■3 35 44C 445 

Ser Lys Pro Trp Ser Lys Cys Thr Ser Ala Thr He Thr Glu Phe leu 
450 455 460 

5 Asp Asp 31y .-iis Gly Asn Cys Leu Leu Asp Leu Pro Arg Lys 3-n lie 
465 470 4~5 43C 

Leu Gly Pre Glu Glu Leu Fro Gly Gin Thr Tyr Asp Ala Thr 31 r. Gin 

455 490 ' * 495 

Cys Asn Leu Thr Phe Gly Fro Glu Tyr Ser Val Cys Fro Gly Ket Asp 
10 500 505 510 

Val Cys Ala Arg Leu Trp Cys Ala Val Val Arg Gin Gly Gin Met Val 

515 52C 525 

Tys Leu Thr Lys Lys Leu Fro Ala Val Glu Gly Thr Frc Cy3 Gly Lys 
53: 535 540 

15 Gly Arg Val lys Leu Gin Gly Lys Cys Val Asp Lys Thr Lys Lys Lys 
54 5 551 55 5 56 0 

Tyr Tyr Ser Thr Ser Ser His Gly Asn Trp Gly Ser Trp Gly Fro Trp 

565 57C 575 

Gly Gin Cys Ger Arg Ger Cys Gly Gly Gly Val Gin Phe Ala Tyr Arg 
20 =90 535 590 

His Cys Asn Asn Pro Ala Fro Arg Asn Ser Gly Arg Tyr Cys Thr Gly 

535 600 605 

Lys Arg Ala He Tyr Arg Ser Cys Ser Val Thr Fro Cys Pro Pro Asn 
510 615 62 3 

2 5 Gly Lys Ser Phe Arg His Glu Gin Cys Glu Ala Lys Asn Gly Tyr Gin 

625 " 630 635 ' 640 

Ser Asp Ala Lys Gly Val Lys Thr Phe Val Glu Trp Val Pro Lys Tyr 

64 5 550 655 

Ala Gly Val Leu Pro Ala Asp Val Cys Lys Leu Thr Cys Arg Ala Lys 
30 660 665 670 

Gly Thr Gly Tyr Tyr Val Val Phe Ser Fro Lys Val Thr Asp Gly Thr 

575 680 685 

Glu Cys Arg Pro Tyr Ser Asn Ser Val Cys Val Arg Gly Arg Cys Val 
6 5-0 6 35 700 

3 5 Arg Thr Gly Cys Asp Gly He He Gly Ser Lys Leu Gin Tyr Asp Lys 

705 710 715 720 

Cys Gly Val Cys Gly Gly Asp Asn Ser Ser Cys Thr Lys He He Gly 

725 750 735 

Thr Phe Asn Lys Lys Ser Lys Gly Tyr Thr Asp Val Val Arg He Pro 

4 0 74 0 ~4 5 75 0 

Glu Gly Ala Thr H:s He Lys Val Arc Gin Fhe Lys Ala Lys Asp Gin 

755 760 765 

Thr Arg Fhe Pro Ala Tyr Leu Ala Leu Lys Lys Lys Thr Gly Glu Tyr 

"70 775 780 

4 5 Leu He Asn Gly Lys Tyr Net He Ser Thr Ser Glu Thr He He Asp 
H5 ~SC HH =-C 

He Asr. Gly Tr. r Val Met Asn Tyr Ser Gly Trp Ser His Arg Asp Asp 

e:5 5io a : s 

Phe Leu His Gly Ms :. Gly Ty: Ser Ala Th: Lys Glu He Leu He Va 1 
El n :■ HE n : 

Gin He Lou Ala Thr Asp Pre Thr Lys Ala Leu Gly Val Arg Tyr Ser 

335 S40 c45 

Phe Phe Val Pro Lys Lys Thr Thr Gin Lys Val Asn Ser Val He Ser 

5 5: 555 550 

55 His Gly Ser Asn Lys Val Gly Prr His Ser Thr Gin Leu Gin Trp Val 
- C 5 5 7 C 3 " r SBC 

Thr Gly Pro Tip Leu Ala Cys Ser Ai g Thr lys Asp Tr.r Gly Trp His 
r r-- r? r . »=■? 
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< 2 1 C > 5 
<211> 152C 
<212> DN'A 
5 <213> heme sapiens ADAKT3-5 



<4 3 3 
gg.acat tcac 
aeagaagata 

10 tecaaatgea 
ctggacctac 
gccacccagc 
gtccgtgctc 
gagegcagtt 

15 gcacagggaa 
aatcactccg 
tcaaaa:ttt 
agctgacctg 
atggcactga 

2 0 ctggctgtga 
gagacaactc 
ctgacgtggt 
aagaccagac 
tcaatggaaa 

25 tgnactatag 
cgaaggaaat 
gttatagctt 
gcagcaataa 
cctgctctag 

30 aceggaagtt 
tgt tgaagaa 
cancacccgc 
taatgacat t 



ctggcctctc 
agegcttaat 
cttcagccac 
cacgaaagca 
agtgcaacct 
gcctgtggtg 
tgcctatcgt 
gagggecate 
t catgaacag 
tgtggaatgg 
cagagecaag 
atgtaggccg 
eggcatcatt 
cagccgtaca 
caggattcct 
tagat tcact 
gtacatgatc 
cggttggagc 
tctaatagtg 
ttttcttccc 
agtgggatca 
gacctgtgac 
agcaaaagga 
atgt tagc:: 
gca&tcr.r.g r 
gtcaaccagg 



ccatgacgac 
gc cttcca:c 
catcacagaa 
gatcctgggc 
gaeatteggg 
tgctgtggta 
:ac:gtaa:a 
taccactcct 
tgtgaggcca 
gttcccaaat 
ggcactggct 
tacagtaatt 
ggctcaaagc 
aagattgttg 
gaaggggcaa 
gectatttag 
tccactccag 
cacagggatg 
cagat tcttg 
aagaagecca 
cacactccgc 
acacgttggc 
tgtcctctct 
gtgggttatg 
tgaacaggaa 



tccaaattct 
ct taccagca 
ttcctggatg 
cergaagaac 
cctgagtact 
cg:cagggce 
acect gctcc 
gcagt :tcat 
aaaat ggc ta 
atgcaggtgt 
actatgtggt 
ccgtctgc gt 
tgcagtat ga 
gaacctt t aa 
cccacitaaa 
ccctgaaaaa 
agacc atcat 
acttcrtg :a 
caacagae rc 
ct::caaaa3t 
ageegcagtg 
acacc aga^c 
cccaaaggcc 
atettatege 
cg;:taccc:a 



gtgaagagac 
ttgatgeate 
atggccatgg 
tcccaggaca 
ccgtgtgtcc 
agatggtctg 
cagaaacaac 
gccctgccca 
tcagtctgat 
cctgccagcg 
attttctcca 
c^gggggaag 

caagtgcgga 
taagaaaagt 
agtccgacag 
gaaaaacggt 
tgacatcaat 
tggcatgggc 
cactaaacca 
a aactctgtc 
ggtcaeggge 
ggcgcagtgc 
ttctccgtt t 
acaaaagata 
aegcacagaa 



ctttggttce 
taagccctgg 
taactgtttg 
gacctacgat 
cgccangga: 
tctgaccaag 
ggacgctacc 
cccaatggta 
gcaaaaggag 
gatgtgtgca 
aaggtgaccg 
tgtgtgagaa 
gtatgtggag 
aagggttaca 
ttcaaageca 
gagtacetta 
ggaacagtca 
tactctgcca 
ttagatgtcc 
actagt catg 
ccacggctcg 
caggatggaa 
aagcaatgc: 
ctggaggat t 
agtcatgett 



6C 
12C 
18C 
24C 

3CC 
3 6:: 

4BU 

5 4 0 

6 00 

6 60 
72U 

7 EU 
84 U 
9CC 
96 U 
1C20 
1C80 

:i4 0 

1260 
1320 
13 30 
14^0 
1530 
1520 
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<22C > 

<221> CDS 

<222 > (22 ? . . !2c CI ) 



45 



r.i sc_: 

1 Z ? 



eat.; re 



< 4 : : r. 4 



..ta^aL i at 3 J-:; ji: ttj tgg aag a_-g 1 1 j a-. tug 
r-:et Glu He Leu Trp Lys Thr Leu Tr.r Trp 
1 5 10 



att ttg age etc ate atg get tea teg gaa ttt cat agt gae cae agg 99 
55 lie Leu Ser leu He Vet Ala Ser Ser -31u ?he His Ser Asp His Arc 



r 1 - 
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Fhe Thr Val Lys A = r. Asp Lys His Ser Arg Arc Arg Arg Ser Xe: Asp 

e: 6 5 "to 

cc: at- gat ::a cag cag gca gta t;: aag tta tct ttt aaa tea 251 

5 Pro He Asp Pre Gin Glr. Ala Val Ser Lys Leu Phe Phe Lys Leu Ser 

7 5 8C B5 SO 

gec tat ggc aag cac ttt cat eta aac ttg act etc aac aca gat ttt 339 

Ala Tyr GLy Lys Kis Fhe His Leu Asn Leu Thr Leu Asn Thr Asp Phe 

10 95 100 105 

gtg tec aaa cat ttt aca gta gaa tat tgg ggg aaa gat gga ccc cag 337 

Val Ser Lys His Phe Thr Val Glu Tyr Trp Gly Lys Asp Gly Fro Gin 

::o us 120 

15 

tgg aaa cat gat ttt tta gac aac tgt cat tac aca gga tat ttg caa 435 

Trp Lys His Asp Phe Leu Asp Asn Cys His Tyr Thr Gly Tyr Leu Gin 

12 5 13 C 135 

2C gat caa cgt agt aca act aaa gtg get tta age aac tgt gtt ggg ttg 433 

Asp Gin Arg Ser Thr Thr Lys Val Ala Leu Ser Asn Cys Val Gly Leu 

140 145 153 

cat ggt gtt att get aca gaa gat gaa gag tat etc ate gaa cct tta 531 

25 His Gly Val He Ala Thr Glu Asp Glu Glu Tyr Phe lie Glu Pro Leu 

155 160 ' 165 170 

aag aat acc aca gag gat tec aag eat ttt agt tat gaa aac ggc cac 579 

Lys Asn Thr Thr Glu Asp Ser Lys His Phe Ser Tyr Glu Asr. Gly His 

3C 175 180 135 

cct cat gtt att tac aaa aag tct :cc ctt caa :aa eg a cat ctg tat £27 

Fro His Val He Tyr Lys Lys Ser Ala Leu Gin Gin Arg His Leu Tyr 

190 * ' 195 2C0 

35 

gat cac tct cat tgt ggg gtt teg gat ttc aca aga agt ggc aaa cct $~>5 

Asp His Ser His Cys Gly Val Ser Asp Fhe Thr Arg Ser Gly Lys Pro 

205 210 215 

40 tgg tgg ctg aat gac act cca ctg ttt ctt att :ac tac caa att aac ''23 

Trp Trp Leu Asn Asp Thr Pro Leu Phe Leu He His Tyr Glr. He Asn 

"2 20 * 22 5 23 0 

aac aca cat ate cac cac aga cag aac aga tea gtg age att gaa egg 

4 5 Asr. Tr.r Hi 6 He His His Arg Gin lys Arg Ser Val Ser He Glu Arg 

2 1- E 2 4 2 145 2 5 0 

ill gtg g c g 3l\. ttg et j gtg gca ; a j aaa ati ,tg jtg ggc tar cat 

The Val Glu Thr Leu Val Val Al= Asp Lys y.et vet Val Gly Tyr His 

3 : : : • i f 2 ■: 5 

gge cge aaa gac att gaa cat tac att ttg agt gtg utg aat att gtt 357 

Gly Arg Lys Asp He Glu His Tyr He Leu Ser Val Vet Asn He Val 

2H 2^5 2SC 

55 

gec aaa ctt tae cgt gat tec age eta gga aac gtt gtg aat att at a 915 

Ala Lys Lei. Tyr Arg Asp Ser Ser Leu Gly Asn Val Val Asn He Ho 
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9 



tee at: etc tee :ac caa ag: ga: gga aac arc a:t :ca gaa aat ggg 135? 

Ser He Leu Ser His Gin £er Asp Gly Asn Thr He ?ro Glu Asn Gly 

3 3 5 3 4 0 3 4 5 

5 

a:: gee cac cac gat aat gca gtt ctt at: act aga tat cat ate tgc 11C7 

He Ala Hi 3 His Asp Asn Ala 7a 1 Leu He Thr Arc Tyr Asp He Gys 

35C 355 350 

10 act tat aaa aat aag ccc tgt gga aca ctg ggc ttg gec tct gtg get 1155 

Thr Tyr Lys Asn Lys Fro Cys Gly Thr Leu Gly Leu Ala Ser Val Ala 

365 370 375 

gga atg tgt gag cct gaa agg age tgc age att aat gaa gac att ggc 1203 

15 Gly Ye: Cys Glu Pre Glu Arc Ser Cys Ser He Asr Glu Asp He Gly 

3SC 355 35C 

ctg ggt tea get ttt acc att gca cat gag att get cac aat ttt ggc 1251 

_eu Gly Ser Aid Phe Thr lit Ala Hit Glu He ','dl His. As>;i F.ie Gly 

20 3 95 4 30 4 05 4 13 

atg aac cat gat gga att gga aat tct tgt gga cga aag gtc atg aag 129$ 

Met Asr. Kis Asp Gly He Gly Asn Ser lys Gly Arg Lys Val Met Lys 

415 42C 425 

25 

cag caa aac tat ggc age tea cat tac tgc gaa tac caa tec ttt ttc 1347 

Gin Gin Asn Tyr Gly Ser Ser His Tyr Cys Glu Tyr Gin Ser Fhe Phe 

430 435 440 

3 0 ctg gec tgc ttg cag teg aga nta cat cac cag etc ttt aga gaa gtg 1335 

Leu Val Cys Leu Gin Ser Arg Xaa His His Gin Leu Phe Arg Glu Val 

445 450 455 

tgt aga gag etc egg tgt etc ace aaa age aac egc tgt gee acc aac 1443 

3 5 Cys Arg Glu Leu Trp Cys Leu Ser Lys Ser Asr. Arg Cy3 val Thr Asn 

4 e C 455 470 

agt act cca gca get cag ggg aca ctg tgt caa act ggg aat act gaa 1491 

Ser lie Pro Ala Ala Glu Gly Thr Leu Cys Gin Thr Gly Asn He Glu 

40 4"5 4S0 4S5 45C 

aaa ggg egg tgt tat cag gga gac tgt get cct ttt ggc act egg ccc 1539 

Lys Gly Trp Cys Tyr Gin Gly Asp Cys Val Fro Fhe Gly Thr Trp Fro 

495 500 505 

■: 5 

c^g age -it a g=.: ggg ggc egg gg: ;:; :gg tea era :gg gg.3 gag :gc 15" 

G". n Ser He Asp Gly Hy Trp Gly 7rz Trp Ser I.eu Trp Gly Glu Cys 



11 .i .: :• agg :*c tgc j g gj gg = gg"' gtn r ~c zz\ : ?.; e: 3 3 a ~3~ cgr :? ?. c '. c '•; 

Ser Arg Thr Cys Gly Gly Gly Val Ser Ser Ser Leu Arg His Cys A = p 

525 ' 530 535 

ag: cea gca ec: teg gag gtg gaa aaa cat tge ctt ggg gaa agg aaa 1533 

55 Ser Pre Ala Fro Ser Glu Val Glu Lys Tyr Cys Leu Gly Glu Arg Lys 

c 4 C 5 4 c 5 5 0 

- -i- ----- t-.-;: ri; v::. t^- ? it- ret : : -n- ~ ~~ t '. ~ '■ '. 
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in 



1015 



2 06-7 



: 1 15 



H63 



Lys Tyr Tyr Asn Trp Lys Pre Tyr . :ir Gly Giy Gly Val Lys P:: Cys 

59C 595 £00 

gca tta aac tge teg get gaa get tat aat ttc tac act gaa cgt get 1575 

5 Ala Leu Asn Cys leu Ala Glu Gly Tyr Asn Fhe Tyr Thr Glu Arg Ala 

6C5 61C 515 

cct gcg gtg ate gat ggg acc eag tge aat gcg gat tea ctg gat ate 1923 

Pro Ala Val He Asp Gly Thr Gin Cys Asn Ala Asp Ser Leu Asp He 

1C 620 625 630 

tge ate aat gga gaa tge aag eac gta ggc tgt gat aat att ttg gga 1971 

Cys He Asn Gly Glu Cys Lys His Val Gly Cys Asp Asn He Lieu Gly 

635 643 €45 650 

15 

tct gat get agg gaa gat aga tgt cga gtc tgt gga ggg ggc gga age 

3er Asp Ala Arg G;u Asp Arg Cys Arg Val lys Gly Gly Gly Gly Ser 

655 660 665 

20 aca tgt gat gec att gaa ggg ttc ttc aat gat tea ctg cec agg gga 

Thr Cys Asp Ala He Glu Giy ?he Phe Asn Asp Ser Leu Pro Arg Giy 

670 675 630 

ggc tac atg gaa gtg gtg eag ata tea aga ggc tct gtt cac att gaa 

2 5 Gly Tyr Met Glu Val Val Gin He Pro Arg 31y Ser Val His lie Glu 

635 690 595 

gtt aga gaa gtt gee atg tea aag aac tat att get tta aaa tct gaa 

Val Arg Glu Val Ala Met Ser Lys Asn Tyr He Ala Leu Lys Ser Glu 

30 700 705 71C 

gga gat gat tac tat att aat ggt gee egg act att gae tgg cct agg 2211 

Gly Asp Asp Tyr Tyr He Asn Gly Ala Trp Thr lie Asp Trp Fro Arg 

715 " * 72C 725 730 

35 

aaa ttt gat gtt get ggg ata get :tt cat tac aag aga cca act cat 2259 

Lys ?he Asp Val Ala Gly Thr Ala Phe His Tyr Lys Arg Fro Thr Asp 

725 740 745 

40 gaa cca gaa tee ttg gaa get eta get cct arc tea gaa aat cte ate 23C7 

Giu Pro Glu Ser Leu Glu Ala Leu Giy Fro Thr Ser Glu Asn Leu He 

750 7 5*S 760 

etc atg gtt :tg ctt eaa gaa eag aat ttc gga att agg tat aag ttc 2355 

4 5 Val Met Val Leu Leu Glr. 31 J Gin As:: Leu Hy He Arg Tyr Lys Fhe 

"65 ~H ~ 7 5 

aat gtt ::: it" act cga art gg:: ngt gga :a: aat gaa gtt ggt ttt : 4 'H 

Asn Vai Fro H- Thr Arg Tnr Gly Ser Gly Asp Asn Glu Yai C-i.y rhe 

- ■ ~?.~ 7 z ■ 

aca tgg aat rat cag cet tgg tea gaa tge tea gtt act tgt get gga 14 51 

Thr Trp Asn Hi a Gin Pre Trp Ser Glu Cys Her Ala Thr Cys Ala Gly 

-95 sec =:= sic 

5 5 

cgt aag atg ::: act agg rag err act tag age gca aga tgg aga aca : 4 & 9 

Glv Lys Met Pre Thr Arc Gin Prr Tnr Gin Arc Ala Arg Trp Arg Thr 

= ;- ?2C 925 
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ctt :aa ttaca::a:a :i;ca::tg: tt:c5a;::c atg:aat::a tgcaaatrtg 2H1 
Leu 

3 c ; 

5 

ttggtaaaat acatcctcgs acaatgagtg tctctgctgg :cct:c:ccc aagac:at;: 2711 
tgaaggtggg ctgtttgcct ttcgrgaaca cattcttggt aaagaacatc aaaagzttta 2771 
10 aaaaagaaaa tgagcaagaa tcagacatca cagatgcaac ttcttgtaat gggagatgag 2S31 
gagaatgtac ggctgtg 2 5-3 5 



15 < 2 1 C > = 
<211> :S9 
<212> PRT 

<213> Hor.c sapiens ADAMTS-6 
2 0 < 4 0 0 > 5 

Met Glu lie Leu Trp Lys Thr Leu Thr Trp He Leu 3er Leu He Ket 

1 5 10 15 

Ala Ser Ser Glu Fhe His Ser Asp Hie Arc Leu Ser Tyr Ser Ser Gin 
2D 25 30 

2 5 Glu Glu Fhe Leu Thr Tyr Leu Glu His Tyr Gin Leu Thr lie ?rc lie 

35 43 45 

Arg Val Asp 31 r. Asr. Gly Ala ?he Leu Ser Fhe Thr Val Lys Asn Asp 

50 5 5 €C 

Lys Kis Ser Arg Arg Arg Arg Ser Met Asp Fro He Asp Fro Gin Gin 

3 0 65 7 3 7= 5C 

Ala Va:. Ser Lyi Leu Fhe Fhe Lys Leu Ser Ala Tyr Gly Lys- His ?he 

85 90 S5 

His Leu Asn Leu Thr Leu Asn Thr Asp Phe Val Ser Lys His Phe Thr 
10) 1C5 110 

3 5 Val Glu Tyr Trp Gly Lys Asp Gly Pre Gin Trp Lys His Asp Phe Leu 
115 123 125 

Asp Asn Cys His Tyr Thr Gly Tyr Leu Gin Asp Gin Arg Ser Thr Thr 

130 135 14 C 

Lys Va. Ala Leu Ser Asn Zys Val Gly Leu His Gly Val He Ala Thr 
4C 14 5 15C 15 5 16C 

Glu Asp Glu Glu Tyr Phe lie Glu Pre Leu Lys Asn Thr Thr Glu Asp 

165 170 175 

Ser Lys His Phe Ser Tyr Glu Asn Gly His Fro His Val He Tyr Lys 
131 155 1?D 

45 Lys Ser Ala Leu Gin Hn Arg His Leu Tyr Asp His Ser Hi? Cys Gly 
1?5 2 2 j ICS 

Val Ser Asp Fhe Thr Arg Ser Gly Lys Pre Trp Trp Lei: Asn Asp Thr 

21:: :: z 22:; 

Hu Ll-.. Phe Leu lie His lyr Gin He Asr. Asn Thr His lie H-s His 

r ■: 2 2 2 " t : 5 : 4 - 

Arg 31:; Lys Arg Ser Val Ser He Glu Arg Phe Val Glu Tnr Leu Val 

245 25C 255 

V3I Ala Asp Lys Met '-'e- Val Gly Tyr His Gly Arg Lys Asp He Glu 

2 5 0 2S5 2 H 

55 His Tyr He Leu Ser Val Met Asn He Val AH Lys Leu Tyr Arg As:; 

2^5 2 5 2 2£b 

Ser Ssr Leu Sly As;; VH VH Asn He He Vu.1 Alu Arg Leu lie Val 

2 r - " 2 :j " _ ". J 
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31 y 


Thr 


Leu 


Gly 


Leu 


Ala 


Ser 


val 


Ala 


Gly 


Met 


Cys 


Glu 


Pre 


Glu 






370 










3^5 










3 30 












Arg 


Ser 


Cys 


Ser 


He 


Asr. 


Glu 


Asp 


He 


Gly 


Leu 


Gly 


Ser 


Ala 


Phe 


Thr 




3 S 5 










3?: 










~i 95 










4 C C 


5 


lie 


Ala 


His 


Glu 


He 


Val 


His 


Asn 


Fhe 


Gly 


Ket 


Asn 


His 


Asp 


Gly 


He 












4C5 










413 










4 15 






Gly 


Asn 


Ser 


:ys 


Gly 


Arg 


Lys 


Val 


Met 


Lys 


Gin 


Glr. 


Asr. 


Tyr 


y-v 


Ser 










42C 










425 










430 








Ser 


HIS 


Tyr 


: y s 


Glu 


Tyr 


Gin 


Ser 


Phe 


Fhe 


Leu 


Val 


Cys 


Leu 


Gin 


Ser 


10 






435 










440 










445 










Arg 


Xaa 


His 


His 


Gin 


Leu 


Fhe 


Arg 


Glu 


Val 


Cys 


Arg 


Glu 


Leu 


Trp 


Cys 






450 










'15 5 










46 0 












Leu 


Ser 


Lys 


Ser 


Asn 


Arg 


Cys 


Val 


Thr 


Asr. 


Ser 


He 


Pro 


Ala 


Ala 


Glu 




465 










470 










4^5 










450 


15 


Gly 


Thr 


Leu 


Cys 


Glr. 


Thr 


Gly 


Asn 


lie 


Glu 


Lys 


Gly 


Trp 


Cys 


Tyr 


31 1: 












465 










4 90 










455 






Gly 


A3? 


Cys 


Val 


Pro 


Phe 


Gly 


Thr 


Trp 


Pro 


Gin 


Ser 


He 


Asp 


siy 


31y 










5 0 0 










5 05 










510 








Trp 


GH 






3ci 


L c u 


Tiu 


y 


Glu 






A,g 


Ti.i 


Cys 


=iy 


3ly 


2C 






515 










520 










525 










Gly 


Val 


Ser 


Ser 


Ser 


Leu 


Arg 


His 


Cys 


Asp 


Ser 


Pro 


Ala 


Pro 


3er 


3lu 






53C 










535 










540 












val 


Glu 


Lys 


Tyr 


Cys 


Leu 


Gly 


Glu 


Arg 


Lys 


Arg 


Tyr 


Arg 


Ser 


:ys 


Asr. 




545 










550 










555 










560 


25 


Thr 


Asp 


Fro 


Cys 


Pro 


Leu 


Gly 


Ser 


Arg 


Asp 


Fhe 


Arg 


Glu 


Lys 


Gin 


Tys 












565 










570 










5 7E. 






Ala 


Asp 


Fhe 


Asp 


Asr. 


Met 


Pro 


Fhe 


Arg Gly 


Lys 


Tyr 


Tyr 


Asr. 


Trp 


Ly s 










530 










585 










550 








Fro 


Tyr 


Thr 


Gly 


Gly 


Gly 


Val 


Lys 


Pro 


Cys 


Ala 


Leu 


Asn 


Cys 


Leu 


Ala 


30 






595 










600 










6C5 










Glu 


Gly 


Tyr 


Asn 


Fhe 


Tyr 


Thr 


Glu 


Arg 


Ala 


Pro 


Ala 


Val 


He 


Asp 


Gly 






610 










615 










620 












Thr 


Gin 


Cys 


Asn 


Ala 


Asp 


Ser 


Leu 


Asp 


He 


Cys 


lie 


Asn 


Gly 


Glu 


Cys 




625 










€3 0 










63 5 










64 0 


35 


Lys 


His 


Val 


Gly 


Cys 


Asp 


Asn 


He 


Leu 


Gly 


Ser 


Asp 


Ala 


Arg 


Glu 


Asp 












6 4 5 










650 










6 5 5 






Arg 


Cys 


Arg 


Val 


Cys 


Gly 


Gly 


Gly 


Gly 


Ser 


Thr 


Cys 


Asp 


Ala 


He 


Glu 




















665 










6 70 








Gly 


Phe 


Phe 


Asr. 


Asp 


Ser 


Leu 


Pro 


Arg 


Gly 


Gly 


Tyr 




Glu 


Val 


Val 


40 






675 










63C 










685 










Gin 


He 


Pro 


Arg 


Gly 


Ser 


Val 


His 


He 


Glu 


Val 


Arg 


Glu 


Val. 


Ala 


y.ez 






£93 










695 










7 j ; 












Ser 


Lys 


Asn 


Tyr 


He 


Ala 


Leu 


Lys 


Ser 


Glu 


Gly 


Asp 


Asp 


Tyr 


Tyr 


He 




7 05 










7 : d 










715 










72 0 


45 


Asr. 


3-y 


Ala 


Trp 


Thr 

~'2'z 


lie 


Asp 


Trp 


Pre 


Arg 
73 ; 


Lys 


Pne 


Asp 


Val 


Ala 

H 5 


3 1 y 






a: j 


T'r.o 


His 


Tyr 


Ly J 


Arg 


Pre 


Thr 

"7 4 5 


Asp 


G ' " 


? r"o 


■ : 


Ser 
75 : 


Leu 




- - 


Ala 


Le.i 


" - y 


^41 

Pre 




Se r 


ciu 


A; r. 


Leu 








' ~ r 




If.. 






Glu 


Gin 


A sr. 


Leu 


-y 


He 


Arg 


Tyr 


Lys 


Phe 


As r. 


Val 


Pre 




Thr 


Arg 






7 7 j 










775 










7c C 












Thr 


Gly 


Ser 


Gly 


Asp 


Asn 


Gl J 


Val 


Gly 


Fhe 


Thr 


Trp 


Asr. 


His 


Gln 


Pro 




"55 










79C 










795 










jOC 


5 5 


Trp 


Ser 


Glu 


Cys 


Ser 


Ala 


T h r 


Cys 


Ala 


Gly 


Gly 


lys 


Met: 


Pre 


Thr 


Arg 






















= 10 










f 1 E 






Glr. 




Thr 






A 1 a 


Arg 


Trp 


Arg 


Tnr 


Lys 


His 




Le j 


S c r 


Tyr 
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IJ 



<212> di:a 

<213> Hc-.o sapiens ALAM73-7 

5 <22C> 

<221> CCS 

<222 > (13) . . (3C03; 
<40C> 6 

10 ccggttcctg cc atg ccc ggc ggc ccc agt ccc cgc age ccc gcg cc: ttg 51 
y.ez Pro Gly Gly Fro Ser Pro Arg Ser Pro Ala Pro Leu 
15 1C 

ctg cgc ccc etc etc erg etc etc tgc get ctg get ccc ggc gec ccc 99 
15 Leu Arg Fro Leu Leu Leu Leu Leu Cys Ala Leu Ala Pro Gly Ala Pro 
15 20 25 

gga ccc gca cca gga cgt gca ace gag ggc egg geg cca ctg gac ate 14 7 
Gly Pro Ala Pre Gly Arg Ala Thr Glu Gly Arg Ala Ala Leu Asp lie 
2 0 20 3 5 AO 4 5 

gtg cac ccg get cca gtc gae gcg ggg ggc tee tte ctg tea tac gag IS5 

Val His Pro Val Arg Val Asp Ala Gly Gly Ser Fhe Leu Ser Tyr Glu 
5C 55 63 

25 

etc tag ccc cgc gca ctg cgc aag egg gat gta tec gtg cgc cga gac 243 

Leu Trp Fro Arg Ala Leu Arc Lys Arg Asp Val Ser Val Arg Arg Asp 
55 ?C 75 

30 gcg ccc gec ttc tac gag eta caa tac cgc ggg cgc gag ctg cgc ttc 291 
Ala Pro Ala Fhe Tyr Glu Leu Gin Tyr Arg Gly Arg Glu Leu Arg Phe 
B0 £5 50 

aac ctg acc gec aat cag cac ctg ctg gcg ccc ggc ttt gtg age gag 335 
35 Asn Leu Thr Ala Asn Gin His Leu Leu Ala Fro Gly ?he Val Ser Glu 
5 5 :00 105 

acg egg egg cgc ggc ggc ctg ggc cgc gcg cac ate egg gec cac acc 38~? 
Thr Arg Arg Arc Gly Gly Leu Gly Arg Ala His lie Arg Ala His Thr 
4 0 110 " 115 12C 125 

ccg gec tgc cac ctg ctt ggc gag gtg cag gac eet gag etc gag ggt 435 

Pre Ala Cys His Leu Leu Gly Glu Val Gin Asp Pro Glu Leu Glu Gly 
13C 13 5 14 0 

45 

gee erg gcg gec ate ^g: gec tge gar gge etg uua ggt gig tte cag 4fc3 

Gly Leu Ala Ala lie Ser Ala Cys Asp Gly Leu Lys Cly Val Fhe Gin 
14= 15: 1:E 

51 tee aac gag gae tar tte att gag e. e etg g_. e agt gee teg ge_ : .: - 

Leu Ser Asn Glu Asp Tyr Fhe He Glu Pre Leu Asp Ser Ala Pre Ala 
16 0 * 16 5 17C 

egg cet gge cac gee eag ccc eat gtg gtg tac aac cgt cag gec eeg 579 
55 Arg Pre Gly His Ala Gin Pre His Val Val Tyr Lys Arg Gin Ala Pr: 

17^- ;ec :55 
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619 



6 6" 



915 



96 3 



1C11 



1059 



Arg 31 r. Gin Trp Arg Arg Pro Arg leu Arc At 9 Leu Hie Gin Arg Ser 

225 23C 235 

gtt age aaa gag aag tgg cat gag ice ctg gta gta get gat gee aaa 

5 Val Ser Lys Glu Lys Trp Cys Glu Thr Leu 7a 1 Val Ala Asp Ala Lys 

2 4 0 ' 24 5 2 3C 

atg gtg gag cac cac gga cag ccg cag gtt gag age tat gtg ctg acc 

Met Val Glu Tyr His Gly Gin Fro Gin Val Glu Ser Tyr Val Leu Thr 

10 255 * 260 265 

ate atg aac atg gtg get ggc ctg ttt cat gac rcc age act ?gg aac 

He Met Asn Met Val Ala Gly Leu Fhe Hie Asp Pro Ser He Gly Asn 

210 275 2SC 285 

15 

ccc ate cac ate acc ate gtg cgc ctg etc etg rtg gaa gat gag gag 

Frc He His He Thr He Val Arg Leu Val Leu Leu Glu Asp Glu Glu 

2 90 295 3 C 0 

20 gag gac eta aag ate acc cac cat gca gac aac acr ctg aag age ttc 

Glu Asp Leu Lys He Thr His His Ala Asp Asn Thr Leu Lys £er Phe 

3 3 5 313 315 

tgc aag tgg cag aaa age ate aac atg aag ggg gat gee cat ccc ctg 

25 Cys Lys Trp Gin Lys Ser He Asn Met Lys Gly Asp Ala His Fro Leu 

32C 325 333 

cac cat gac act gee ate ctg etc acc aga aag 3a; ctg tgt gca gee 

His His Asp Thr Ala He Leu Leu Thr Arg Lye Asp Leu Cys Ala Ala 

3 0 33 5 34 3 345 

atg aac egg ccc tgt gag acc etg gga ctg tee :at gtg gee ggc atg 

yet Asn Arg Pro Cys Glu Thr Leu Gly Leu Ser His Val Ala Gly Met 

350 355 360 3£5 

35 

tgc cag ccg cac cgc age tec age ate aac gag gar aeg ggc etg ccg 

Cys Gin Pro His Arg Ser Cys Ser He Asn Glu Asp Thr Gly Leu Pre 

370 375 380 

4 0 ctg gee ttc act gta gec cac gag etc ggg cac agt ttt ggc att cag 
Leu Ala Phe Thr Val Ala His Glu Leu Gly His Ser Fhe Gly He Gin 

385 393 395 

eat gac gga age ggc aat gac tgt gag ccc gtt ggg aaa cga ect ttc 1251 

45 His Aac Glv Ser 31 y A=n Asp Cys Glu Frc Val Gly Lys Arg ?:c He 

A C : 4 H 4 H 

at r at j tet rca cag e t :: : t g tit gar get g- 1 ecc -to ^ - : tgg '- : r • — - 

lie Ser FrL' 3 In le. Leu Tyr Arp AH ,-.H Fro leu Thr Crt Ser 

5 1 HE- H - H - 

cgc tgc age cgc cag tat ate acc agg ttc ctt gac cgc ggg tgg ggc 134" 

Arg Cys Ser Arg 3in Tyr lie Thr Arg Phe Leu Asp Arg Gly Trp Gly 

43C 4 3 5 4 4 3 4 -15 

55 

rtg tar ctg gac gar err. ret gee sag gne att atr gac ttr ere t rg H9: 

Leu Cys Leu Asp ksc Pre He A. a Lys Asp He He Asp Fhe Fro Ser 

4 5 3 4 5 5 4 5 " 



1155 



12 0 E 



4 ^ 1 
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a:a err tgg :cc tec gtg ggg acc a:: tgt cac ::c aag ctg gat g;a 15-29 

Thr Leu Trp Cys Ser Val Gly Thr Thr Cys His Ser Lys Leu Asp Ala 

455 =:c 5 c s 

5 

get gcg gar ggc acc egg :ct ggg gag ar„ aag tgg tgt etc agt ggg IE 5" 

Ala Val Asp Gly Thr Arg Cys Gly Glu Asn Lys Trp Cys Leu Ser Gly 

51C * * 515 520 525 

10 gag tgc gta ccc gtg ggc ttc egg ccc gag gec gtg gat ggt ggc egg 1635 

Glu Cys Val Pro Val Gly ?he Arg Pro Glu Ala Val Asp Gly Gly Trp 

530 535 * 540 

tct ggc tag age gec tgg tec ate tgc tea egg age tgt ggc atg gee 1663 

15 Ser Gly Trp Ser Ala Trp Ser He Cys Ser Arg Ser Cys Gly Met Gly 

5-4 5 55C 555 

gta eag age gec gag egg cag tge acg cag cct acg ccc aaa :ac aaa 1731 

vai nin q<?r Ai? Glu. Arg Glr_ Cys Thr Gin Pre Thr Fro Lys Tyr Lys 

20 560 565 570 

ggc aga tac tgt gtg ggt gag cgc aag cgc ttt cgc etc tgc aae ctg 17^9 

Gly Arg Tyr Cys Val Gly Glu Arg Lys Arg Ph-9 Arg Leu lys Asn Leu 

575 580 585 

25 

cag gee tgc cct get ggc cgc ccc tec etc cgc cac gtc cag tgc age 1627 

Gin Ala Cys Pro Ala Gly Arg Pro Ser ?he Arg His Val Gin Cys Ser 

590 595 SO; 605 

30 cae ttt gac get atg etc tac aag ggc cag ctg cac aca tgg gtg ccc 1875 

His Phe Asp Ala Met Leu Tyr Lys Gly Gin Leu His Thr Trp Val Pro 

610 615 620 

gtg gtc aat gac gtg aae eec tgc gag ctg cae tgc egg eec gcg aat 1S23 

3 5 Val Val Asn Asp Val Asr. Pro Cys Glu Leu His Cys Arg Pro Ala Asn 

6 25 £3 0 ^35 

gag tac ttt gee aag aag ctg egg gac gec tgt gtc gat ggc acc ccc 1971 

Glu Tyr Phe Ala Lys Lys Leu Arg Asp Ala Cys Val Asp Gly Thr Pre 

40 64C 645 650 

tgc tac cag gtc cga gee age egg gac etc tgc ate aac gec ate tgt 2015 

Cys Tyr Gin Val Arg Ala Ser Arg Asp Leu Cys He Asr. Gly He Cys 

655 66 0 655 

45 

dag aac gtg ggc tgt gac ttc gag ?.:: gar tee ggt get .\za gag gac ::-17 

Lys Asr, Val Gly Cys Asp Phe Glu lie Asp Ser Gly Ala ".o: Glu Asp 



5 0 tgc "at cgt gtg tge esc -qr aae ggc ::: ate -qc cae ^c gtg tee 7.'.- 

Arg Cys Gly Val Cys His Gly Asn Gly Ser Thr Cys His Thr Val Ser 
6 9 : 6 5 5 7 0 C 

ggg ace ttc gag gag gec gag ggt ctg ggg tat gtg gat gtg ggg ctg 216. 

5 5 Gly Thr Phe Glu Glu Ala Clu Gly Leu Gly Tyr Val Asp Val Gly Leu 
7 0 5 n 7Z "15 

at r re 3 gcg ^gc gca rcc gag ate r-gc ate caa gag utt gec gag get 27 \'. 
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Asn Gly Gly Trp Thr He Gin Trp Asr. Gly Asp Tyr Gin Val Ala Gly 
~50 755 " " 765 

ace acc t:c aca tac era cgc agg ggc aac tgg gag aa: etc acg tec 23 55 
5 Thr Thr Fhe Thr Tyr Ala Arg Arg Gly Asr. Trp Glu Asn leu Thr Ser 
■77C 775 76C 

ecg ggt ccc acc aag gag cct gtc tgg ate cag gtg cct gec tec cgt 2433 
Pro Gly Pro Thr Lys Glu Pro Val Trp He Gin Val Pro Ala Ser Arg 
10 " 785 790 795 

ggc cea ggc ggg ggg age aga ggc gga etc ccc agg ccc age acc etc 2451 

Gly Fro Gly Gly Gly Ser Arg Gly Gly Val Pro Arg Pro Ser Thr Leu 

8G0 605 an 

15 

cat ggc agg tct cgt cct gga gga gtg age cct ggt tea gtc aca gag 2493 

His Gly Arg Ser Arg Pre Gly Gly Val Ser Pre- Gly Ser Val Thr Glu 

615 620 625 

2C cct ggc tct gag cea ggc cct cct get gcg gec tct acc tea gtt tec 2547 
Pro Gly Ser Glu Pro Gly Fro Pro Ala Ala Ala Ser Thr Ser Val Ser 
83C 335 840 345 

cea tct tta aaa tgg ccc aat ctt gta get gca ctt cac aga jgt gec 2555 
2 5 Fro Ser Leu Lys Trp Pru Asn Lei: Val Ala Ala Val His Arg Gly Gly 
350 855 $62 

tgg ggt caa get cct tta gga ctg ggt gga tgg aga aga cac :tt gtg 2643 
Trp Gly Gin Ala Pre Leu Gly Leu Gly Gly Trp Arg Arg His Leu Val 
30 865 87} 375 

etc acg ggc ccc cgc ctg ccc ace cag ctg ctg ccc cag; gag igc aac 2691 

Leu Met Gly Pro Arg Leu Pro Thr Gin Leu Leu Phe Gin Glu Ser Asn 

660 665 690 

35 

cct ggg gtg cac tac gag tac acc ate cac agg gag gca ggt ggc cac 2 73 9 

Pro Gly Val His Tyr Glu Tyr Thr He His Arg Glu Alu Gly 3ly His 

395 500 5C5 

40 gac gag gtc ccg ccg ccc gtg etc tec tgg cat tat ggg ccc tgg acc 27£7 
Asp Glu Val Pre Pro Pro Val Phe Ser Trp His Tyr Gly Pro Trp Thr 
510 915 920 ' 925 

aag tgc aca gtc acc tgc ggc aga ggt gag aag tgg eg.- agg :ac age 263 5 
4: Ly = Gys Thr Val Thr Gys Gly Arg Gly Glu Lys Trp Gly Arg -lis Ser 

'-.-21 9H 4 J 

cut ace t j c agg ggc v., gtg tct gg.i ::*g ggc- eat tg : ctt et 7 2 « ? :< 

Pre Thr Lys Arg Gly Leu Val Sei Jly Glr. Gly His 7 re Leu -hi L*: ... 

cea get cac tge tgg gee ace acg ggt ttg gaa get tg; tte tct gag 2S31 

Pre Ala His Cy= Trp Ala Thr Thr Gly Leu Glu Val Cy .5 Phe Ser Glu 

9£Z S 65 3~H 

=, c 

ctt cac tte tec ate tgt: jag atg agg eta gcg act gee ctg t gt ccc r. 9" 1 9 

Pre Gin * h>~ S-r He Cy s Jlu v .e: Arg _-_-u Ala He Ala Leu Gyi fie 
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gggeaattte attaacnetg accc::ac:: cceogggegg cego:ic:gagc cgag:du::a 3211 
c:==: 3 2 1 E 

5 

<:n> 99? 

cI12> PR? 
10 ^:i3> Heme sapiens ADAKTS-7 

< 4 C C > 7 

Met Pro Gly Gly Pro 5er Fro Arg Ser Fro Ala Pre leu Leu Arg Pro 
5 1C 15 

15 

Le-u Leu Leu Leu Leu Cys Ala Leu Ala Pro Gly Ala Pro Gly Fro Ala 
20 25 30 

P: e GIv A:g Ala Thr Giu Gly Arg Ala Ala Leu Asp He Vai His Pre 
2 0 35 43 45 

Val Arg Val Asp Ala Gly Gly Ser Phe Leu Ser Tyr Giu Leu Trp Pre 
5 0 5 5 6 0 

2 5 Arg Ala Leu Arg Lys Arg Asp Val Ser Val Arg Arc Asp Ala Fro Ala 
£5 70 75 6C 

Fhe Tyr Giu Leu Gin Tyr Arc Gly Arg Giu Leu Arg Phe Asn Leu Thr 
85 90 95 

30 

Ala Asn Gin H15 Leu Leu Ala Pro Gly Phe Val Ser Giu Thr Arg Arg 
10C 105 11C 

Arg Gly Gly Leu Gly Arg Ala His He Arg Ala His Thr Pre Ala Cys 
35 115 120 125 

Hi 3 Leu Leu Gly Giu Val Gin Asp Pre Giu Leu Giu Gly Gly Leu Ala 
130 135 14C 

4 0 Ala :le Ser Ala Cys Asp Gly Leu Lys Gly Val Phe Gin Leu Ser Asn 
145 150 155 150 

Giu Asp Tyr Phe He Giu Pro Leu Asp Ser Ala Pro Ala Arg Pro Gly 

16 5 17: 1" 

4 5 

His Ala Glr. Pro His Yal Val Tyr Lys Arg Gin Ala Fie Giu Arg Lej 

IcC 155 150 

Ala Gin. Arg 31 y hsz Ser Ser Ala --r:: 5e: Thr Cys Gly Val Jl:\ Val 

e: : • : : : . 1 

Tyr Pre Giu Leu Giu Ser Arg Arg Giu Arg Trp Giu Gin Arg Gin Gin 
210 215 22 C 

55 Trp Arg Arg Pre Arg Leu Arg Arg Leu Hi 5 Gin Arg Ser Val Ser Lys 
2 2= 230 23; 2 : 
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lie ,:.r lie Val Arg Leu Val Leu Le - 31 u Asp Glu 31 - Glu Asp Leu 

2=0 2 ? 5 j j C 

Lys lie Tr.r His His Ala Asp A sr. Tr.r Leu Lys Ser Fhe Cys Lys Trp 
5 305 31C 3 15 323 

Gin Lys Ser He Asr. Met Lys Gly Asp Ala His Pre Leu His His Asp 
325 330 335 

10 Thr Ala lie Leu Leu Thr Arg Lys Asp Leu Cys Ala Ala Met Asn Arg 
340 345 35C 

Fro Cys Glu Thr Leu Gly Leu Ser His Val Ala Gly Met Cys Gin Pre 

3E5 360 36E 

15 

His Arg Ser Cys Ser He Asr. Glu Asp Thr Gly Leu Pre Leu Ala Fhe 
370 375 380 

Tit- Val Ala His Glu Leu Gly Hii 5ei Fhe Gly lie Gin His Asp Gly 
20 335 390 395 400 

Ser Gly Asn Asp Cys Glu Pro Val Gly Lys Arg Fro Phe He Yet Ser 
4C5 410 415 

2 5 Fro Gin Leu Leu Tyr Asp Ala Ala Pre Leu Thr Trp Ser Arg Cys Ser 

420 425 42C 

Arc Glr: Tyr He Thr Arg Phe Leu Asp Arg Gly Trp Gly Leu Cys Lej 
435 4 40 445 

3 0 

Asp Asp Pro Fro Ala Lys Asp He He Asp Phe Fro Ser Val Fro Pro 
450 455 460 

Gly Val Leu Tyr Asp Val Ser His Gin Cys Arg Leu Gin Tyr Gly Ala 
35 455 470 475 480 

Tyr Ser Ala Fhe Cys Glu Asp Met Asp Asn Val Cys His Thr Leu Trp 
485 49C 495 

4C Cys Ser Val Gly Thr Thr Cys His Ser Lys Leu Asp Ala Ala Val Asp 
5 00 5 05 510 

C-iy Thr Arg Cys Gly Glu Asn Lys Trp Cys Leu Ser Gly Glu Cys Val 
515 5 20 52 5 

4 5 

I'll- Val Gly Fhe Arg Fro Glu Ala Val Asr Gly Gly Trp Ser Gly Tip 
5 3 0 =35 54 0 

Ser Al 3 Try Sei 1 1- Cys Ser Arg Ser Cy~ Gly Met Gly Val Cln Ser 

" ~ z -i z - ^ z ' I 5 r t ? 

Ala Glu Arg Gin Cys Thr Gin Fro Tr.r Fro Lys Tyr Lys Gly Arg Tyr 

56 = 5^C 5^5 

55 Cys Val Gly Glu Arg Lye Arg Fhe Arg Leu Cys Asn Leu Gin Ala Cys 

5~D E65 5 : 

- ' " Al 1 :-lv An T - - : .7 H . r- Val ?-\- Se: Hi-' ' • • Arr 
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Ala Lys Lys Leu Arg Asp Ala Cys Val Asp Gly Thr Pro Cys Tyr Gin 
6 4 5 6 5 C t E 5 

Val Arg Ala £er Arg Asp Leu Cys lie Asn Gly lie Cys Lys A.sn Val 
5 66C 665 ' £70 

Gly Cys Asp Phe Glu He Asp Ser Gly Ala Met: 31u Asp Arg Cys Giv- 
ers * 663 685 

10 Val Cys His Gly Asn Gly Ser Thr Cys His Thr Val Ser Gly Thr Phe 

£ 9 C 6 9 5 7 G C 

Glu Glu Ala Glu Gly Leu Gly Tyr Val Asp Val Gly Leu He Pro Ala 
7C5 *?! C 715 72C 

15 

C-ly Ala Arg Glu He Arg He Gin Glu Val Ala Glu Ala Ala Asn Phe 
725 73C 735 

Lcj Ala Leu Aig St: Glu A^jj ?lu Glu Lvb Tyr Phe Leu Asii Gly Gly 
20 74G 745 750 

Trp Thr lie Gin Trp Asn Gly Asp Tyr Gin Val Ala Gly Thr Thr Phe 
755 760 765 

2 5 Thr Tyr Ala Arg Arg Gly Asn Trp Glu Asn Leu Thr Ser Pro Gly Pro 

77C 7 75 78 C 

Thr Lys Glu Pro Val Trp He Gin Val Pro Ala Ser Arg Gly Pro Gly 
735 790 795 8CG 

3C 

Gly Gly Ser Arg Gly Gly Val Fro Arg Pro Ser Thr Leu Kis Gly Arg 
8 C E 5 1 C 815 

Ser Arg Fro Gly Gly Val Ser Fro Gly Ser Val Thr Glu Pro Gly Ser 
35 " 820 ' £2*5 830 

Glu Pro Gly Pro Pro Ala Ala Ala Ser Thr Ser Val Ser Pro Ser Leu 
335 840 545 

4 0 Lys Trp Pro Asn Leu Val Ala Ala Val Kis Arc Gly Gly Trp Gly Gin 
850 3 55 86 0 

Ala Pro Leu Gly Leu Gly Gly Trp Arg Arg His Leu Val Leu Me: Gly 

6 65 37 0 S7E ES0 

4 5 

r r: Arg Leu 1 : : Thr Hn Le^ i r.~ Gin J 1 i';:" A;r. i r: Gly "HI 

- = r - : 

Hi.- Ty: : . . Tyr Thr IH Hi-" Arg Glu AH Hy Gly Hi.v As: Hu Val. 

Ho Pro Pro val Phe Ser Trp His Tyr Gly Pro Trp Thr Lys Cys Thr 
915 920 925 

55 Val Thr Cys Gly Arg Gly Glu Lys Trp Gly Aug His Ser Pro Tnr Cys 
930 S40 

Aru : eu Val v - ~ 1 r. ~". v H •. ~ Trr : ~u ~1 - :.<■-: P r -• A". H > 
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Hy Arc Val His Gly 
99S 



5 

c2::> s 

<2il> 3633 
<212> DNA 

<213> Mus musculus AuCAKTS-8 

10 

<22C> 
<221> CCS 

c222> (273) . . (2992; 
15 c4CC:> 6 

tagggcgact gcacgggacg ccgcggagga cgcgccctcg cggcccgggg cgcoaegtge 6C 

tcgagttetg ctaggttggc tggcgcagga ggagcggget gcgcgatcca gaggggccgc 120 

20 cagggaccgc cgcgccacgt gccgctagec gagtcggccc ccccatccga ttgatcattt ISC 

ttcctggaca gagcgacceg gccgcctcgg gccaecagca cctgcccgcg cgcggcgatc 24 0 

ucttccct: tcccgcgctc cgcagcactc tzccccc atg etc cgc gac ccc acc 295 
2E Met Leu Arg Asp Pro Thr 

1 5 

acc ac: ggg tgg ccg ccc etc ctg ctg ctg eta ttg cag etc ccg ccc 343 
Thr Thr Gly Trp Fro Pre Leu Leu Leu Leu Leu Leu Gin Leu Pro Pro 

30 10 15 20 

ccg cca etc gtc tgc gga gec ccg gcg ggg ccg gga acc ggc 3 eg cag 3 91 
Pre ?:d Leu Val Cys Gly Ala Pro Ala Gly Fro Gly Thr Gly Ala Gin 

25 32 35 

35 

ccc :c: gag eta gtg ctg ccc acg cgc ttg ccc ggc age gcg age gag 43 S- 
Ala Ser Glu Leu Val Val Fro Thr Arg Leu Fro Gly Ser Ala Ser Glu 

4 : 45 5 ci 

40 etc gc: ttc cac ctg tec ccc ttc ggc cag ggc ttc gtg ctg cgc ctg 46" 
Leu Ala Fhe His Leu Ser Ala Fhe Gly Gin Gly Fhe Val Leu Arg Leu 
5 5 6 0 65 70 

gcg cct gac gec age ttc ctg gcg ttg gaa ttc aag ate gag cgc etc ^3E 
45 Ala Pre Asp Ala Ser The Leu Ala Pre Glu Fhe Lys He Glu Arg Leu 
"5 :C 3 = 

= age gcg gtg g:c ggg ggc gag erg gga ctg rgt ggc -ge CcJ 

Gly Gly Ser -er Ala Ala Al.a Gly G.y Glu .-ro Gly Lej Arg Gly ."ye 

5 ? : - ~ \ ■: 

ttc ttc cct ggc aca gtg aat gga gaa egg gag teg ctg gcg geg atg t :■ 1 

Phe Phe Ser Gly Thr Val Aen Gly Glu Arg Glu Ser Leu Ala Ala Xe: 
1C5 110 " 115 

55 

age tg: gtc gcg ggc teg age ggc teg ttc ttg ctg gca ggc gag gag 6 " ? 

e'er Cys Val Ala Gly Trp Ser Gly Ser Phe Leu Leu Ala G.y Glu CO.; 
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got gee gee gaa gtt etc oer ct: cet caa gga erg gag tgg gag gtg =23 

Ala Ala Ala 31u Val Phe Fro Leu Pre 31r, Gly Leu Glu Trp Slu 7a I 
17 3 17 3 ISO 

5 

gag atg ggt aat gcg cag gga cag gag aga agt gac aac gaa gag gac E" 1 ! 

31u Met Glv Asn Glv Gin 31y Glr. Glu Arc Ser Asp Asn Glu Glu Asp 

165 " ' 19: 195 

10 aag aag cag gac aac gag gcg teg etc aaa gag aca gaa gac tec :gc 519 

Lys Lys Gin Asp Lys Glu Gly Leu Leu Lys Glu Thr Glu Asp Ser Arg 
20C 2C5 210 

aaa gtg cca cca ccc ttc 30a tec aaa act aga age aag agg ttc gtg 557 

15 Lys Val Pro Frc Fro Fhe 31y Ser Lys Thr Arg Ser Lys Arc Fhe 7al 

215 22C ' 225 220 

tec gag get cgc tte gtg gaa aca ctt etg gtg get gat gcg tec atg 1215 

See Glu Ala Arg Phe- Val 31u The Leu Leu Val Ala Asp Ala Ser Meu 

2C 235 240 245 

get gee ttc tat ggg ace gac etg cag aac cae ate etc acg gtg atg 1063 

Ala Ala Phe Tyr Gly Thr Asp Leu Gin Asn Hie lie Leu Thr Val Met 

250 ' * 255 26C 

25 

tea atg cca gec cga ate tac aag cae ccg age ate agg aac tec etc 1111 

Ser Met Ala Ala Arg He Tyr Lys His Pre Ser He Arg Asn Ser Val 

265 270 275 

30 aac ctt gtg gtg gtg aaa gtg eta aca gtg gaa aaa gaa aga tgg gge 1155 

Asn Leu Val Val Val Lys Val Leu He Val Glu Lys Glu Arc Trp Gly 
25C 285 29D 



ccg gaa gtg tec gac aac gcg ggg etc aca etg cgc aac ttc tgc age 

3 5 Fro Glu Val Ser Asp Asn 31 y Gly Leu Thr Leu Arg Asn Phe Cys Ser 

295 200 105 310 

tgg caa egg egt ttc aac aag ccc agt gac cgc cac ccg gag eac tat 

Trie Gin Arq Arg Fhe Asn Lys Pro Ser Asp Arg His Fro Glu His Tyr 

4C * " 315 320 325 

gac act gee ate ttg ttc ace aga cag aac ttc tgt ggg aag gga gag 

Asp Thr Ala He Leu Phe Thr Arg Gin Asn Phe Cys Gly Lys Gly Glu 

3 3 0 3 3 5 3 4 C 

4 " 

-a? tgt gac acr etg ggg atg gca gac gtr. pgr ace ate tgt gac ccc 

y.r. 2vs Asp Thr I «u Glv .•'et Ala Asp Val Gly Thr He Gys Asp P r 0 

' ■ - ■ 1H 

?: ]n: aa-g age tgr tea gr g i: c aag gat gag gga cag g~a g-r rr 

Asp Lys Ser Cys Ser Val He Lys Asp Glu Gly Leu Glu Ala Ala Tyr 

360 36 5 3 70 

acc etg gee eat gag eta ggg eac gtt etc age ate ccc cat gat gat 

55 Thr Leu Ala His Glu Leu Gly His Val Leu Ser Met Fro His Asp Asp 

3-5 = 0 : = T; i t j 

r ~.=>.a >~^~ rzsr r:-. .7 - * - r~~. -cc r~- a-:: uoc a=»~ t a- ~ac a t -j 
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Cys 5er Ala Val Tyr Leu Thr 31- Leu leu Asp Asp Gly His 31 y Asp 

4 2 5 4 3j 4 35 

tgt c:: etc gat ccc cc; acc tec at: ctg ::c etc ccc aca ggc etc 1539 

5 Cys Leu Leu Asp Ala Pre Thr Ser Val Leu Fro Leu Pre Thr Gly Leu 

4 4 0 443 450 

ccg ggc cac age acc etc cac gag ctg gac cag cag tgc aag cag ate 1537 

Fro Gly His Ser Thr Leu Tyr Glu Leu Asp Gin Gin Cys Lys Gin lie 

10 455 460 * 465 473 

ttt ggg ccc gat ttc cga cac tgc ccc aac acc tec gtg gag gac acc 1735 

Fhe Gly Pre Asp Fhe Arg His Cys Pro Asn Thr Ser Val Glu Asp lie 

475 460 4S5 

15 

tgt gtc cag etc tgt gec cgt cat egg cat age gat gag ccc att tgc 17B3 

Cys Val Gin Leu Cys Ala Arg His Arg Asp Ser Asp Glu Pro He Cys 

4 90 " 495 " 50 0 

20 cac aca aag aac ggt acc ctg ccc tgg get gat ggt aca ccc tgt ggc 1331 

His Thr Lys A3n Gly Ser Leu Leu Trp Ala Asp Gly Thr Pro Cys 31y 

505 51C 515 

cet egg cac etc tgc ctg gat g=t age tgt gta etc aag gag gat gtg 1379 

25 Pro Gly His Leu Cys Leu Asp Gly Ser Cys Val Leu Lys Glu Asp Val 

520 ' 525 530 

gag aac ccc aag gec gtg gta gat gga gac egg ggt ccc tgg aga ccc 1927 

Glu Asn Pro Lys Ala Vai Val Asp Gly Asp Trp Gly Fro Trp Arg Pro 

30 535 540 545 550 

"33 99 a caa tgt tct cgc acc tjt ggt gga ggg ata caa ttc teg jac 1975 

Trp Gly Gin Cys Ser Arg Thr C/s Gly Gly Gly lie Gin Phe Ser Asn 

555 560 " 565 

35 

cgt gaa tgt gat aat cca atg c;t cag aat cga gga aga ttt tgc etc 2023 

Arg Glu Cys Asp Asn Pro Met Pro Gin Asn Gly Gly Arg Phe Cys Leu 

570 575 530 

40 ggt gaa aga gtc aag tac caa t :a tgc aac aca gac gaa tgt cca cca 107: 

Gly Glu Arg Val Lys Tyr Gin Ser Cys Asn Thr Glu Glu Cy3 Pro Pro 

585 530 " 535 

aac gga aaa age tec cgc gag cac cac cgt gag aaa tat aat ccc tac 1119 

45 Asn Gly Lys Ser Phe Arc Glu Gin Gin Cys Glu lys Tyr Asn Ala Tyr 

5 0 1 - : 5 i 1 C 

aac cac act cac etc gac egg a .it ttc ctg cap t.ce gtc 33c tat . : ( " ~ 

Asn His Thr Asp Acr Gly Asn Phe Leu Gin Trp Val Pre Lys Tyr 

e : 6 1 5 6 ; : e 2 5 c 

tea gga gtg tec ccc cga cac cga tgc dag ctg ttt egc aga gec cgt 1215 

Ser Gly Val Ser Pre Arg Asp Arg Cys Lys Leu Fhe Cys Arg Ala Arg 

635 640 645 

5 5 

399 599 23" 323 = 3-9 ttt gaa get aag gtg ate gat ggc act zz-:\ 

Gly Arc Ser Glu Phe Lys V a 1 Fhe Gl- Ala lys V^l He Asp Gly Thr 

zzZ ' 655 ' 661 
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-9- 393 9-9 tgt ggg 99: aaa ggc a-- =c: tgt agg aag a— ggt 24C~ 

Cys Gly Val Cys Gly Gly Lys Gly Tnr Ala Cys Arc Lys :le Ser Gly 
69 5 70 0 7 C 5 71 C 

5 

tct etc acc c:: age cat ggc ca: aa: gac ate gtc acc ate era 245? 

Ser Fhe Thr Pre ?ne Ser Tyr Gly Tyr Asn Asp He Vai Thr He Pro 
715 720 725 

10 get ggt gcc aca aac att gat gtg aaa cag ccg act cac cca ggg gtc 2=03 

Ala Gly Ala Thr Asn lie Asp Val Lys Glr. Arg Ser His Pro Gly Val 

73J * 73J 740 

agg aac gac gg: age tac ctg gcg eta aag aca gcc aat ggg rag tac 2551 

15 Arg Asn Asp Gly Ser Tyr Leu Ala Leu Lys Tnr Ala Asn Gly 31 n Tyr 

745 75C 755 

ccg etc aat ggt aac ctg gee at: tct gcc ata gag caa gac ate ttg 2559 

Leu Leu Acn Civ Atn Leu Ala lie Ser Ala lit; Giu Gin Asp lie ^eu 

20 7£G ' 765 770 

gtg aag ggg ac: ate ctg aag ta: agt ggc tec atg get acc rtg gag 2c47 

Val Lys Gly Thr He Leu Lys Tyr Ser Gly Ser Met Ala Thr Leu Glu 

775 730 735 790 

25 

egg ctg cag age ttc eag gee ctg cct gag cct ctt aca gta :ag etc 2655 

Arg Leu Gin Ser ?he Gin Ala Leu Pro Glu Pro Leu Thr Val Gin Leu 
755 300 305 

30 ctg act gtg tct get gag gtc tte cct cca aaa gte aga tat ace ttc 2743 

Leu Thr Val Ser Gly Glu Val Phe Pro Fro Lys Val Arg Tyr Thr Pr.e 
613 ' 615 320 



ttt gtc ccc aat gac atg gac ttc age gtg cag aat age aag gaa aga 
3 5 Phe Val Fro Asn Asp Met ^sp Phe Ser Val Gin Asn Ser Lys Glu Arg 
325 * 8 3 0 635 



gga gac tag tct gaa tgt ccg age acg tge aga ggt age tgg eag egg 

Gly Asp Trp Ser Glu Cys Pre Ser Thr Cys Arg Gly Ser Trp Gin Arg 

£55 560 305 fc7C 

4 5 

egg act gtg gad tge -gg gar ere t ca ggt eag gee tet gac acc tgt 

Ara Thr Val Glu Cys Arg Asp Pre Ser Giy Sin Ala Ser Asp Thr Cys 

5 =e: 33 5 

5C gat gag get re aaa cct gag g^t gee =,ag ccc tgt gga age cag ccg 

Asc Giu Ala Leu Lys Pre Glu Asp Ala Lys Pro Cys Gly Ser Gin Pre 

3?2 5 95 5-c c 

tgt ccc etc tgatccectt ggtggaaatc tcctaggctt atggatttgg 

5 5 Cys Pre Leu 
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gta ace acc aac ate att cag tea ctg ccc tct geg gag tgg gtt ctg 2639 
Ala Thr Thr Asn lie He Gin Ser Leu Pre Ser Ala Glu Trp Val Leu 
4 0 34 C 845 350 
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t:;::::c:dt trggtr cccc caataatrs; c::a::::ac ;g~ggggaaa aa:cacra:a 33?: 

caacagg:ac aacgccagcr gngguagcg aacgcuaaag :aagc:ccai aggtatctur 3 45 2 

5 

aagut. arc: t cagaaatg::: cguggctgut: t^caguatta aaatcuctcg tcnasaaggg 3512 

cagcagugtc ca:ca:accg ttatagaaag ccacttttct caggctgcca cccgccgggg 3572 

10 cggacccatit tcaagtattt atgcaaatat gtctccgaac taaagcctgt cttacaccaa 3632 

aagr.gc 363 = 



15 <21C> 3 

<211> 915 

<212> FRT 

<213> Xus rr.usCL.lus AEAXTS - 6 

2 0 <4 0 0> 9 

Mei Leu Arg Asp Fro Thr Thr Thr Gly Trp Pre Pre leu Leu Leu Leu 
I 5 13 15 

Leu Leu Gin Leu Fro Pro Pre Pro Leu Val Cys Gly Ala Pro Ala Gly 
25 2C 25 33 

Fro Gly Thr Gly Ala Gin Ala Ser Glu Leu Val Val Fro Thr Arg Leu 
3 5 4 0 4 5 

3 0 Fro Gly Ser Ala Ser Glu Leu Ala Phe His Leu Ser Ala Fhe Gly Glr. 

50 55 60 

Gly Phe Val Leu Arg Leu Ala Fro Asp Ala Ser Phe Leu Ala Pro Glu 

65 70 75 8 0 

35 

Fhe Lys lie Glu Arg Leu Gly Gly Ser Ser Ala Ala Ala Gly Gly Glu 

85 50 95 

Fro Gly Leu Arg Gly Cys Phe Fhe Ser Gly Thr Val Asn Gly Glu Arg 
40 1C0 105 11C 

Glu Ser Leu Ala Ala Kec Ser Cys Val Ala Gly Trp Ser Gly Ser Fhe 
115 120 125 

45 Lc-u Leu Ala Gly Gl u Glu Phe Thr lie Glr. Fr: Glr: Gly Ala Gly As? 
13 0 13 5 14 C 

S •:• r Leu Asp Glr Fr :• His Arg Leu Glr. Arg Trp Gly Pre- G 1 y Glr Arg 
14 5 * 15 J 155 -0- 

Arg Glu Asc Frc 31.' Leu Ala Ala Ala Glu Val Phe Pro Leu Pro Glr. 

165 1"J 175 

Gly Leu Glu Trp Glu Val Glu V.et Gly Asn Gly Gin Gly Gin Glu Arg 
5 5 1 9*: 1 35 1 5 1 

Ser Asp Asn Glu Glu Asp Lys Lys Gin Asp Lys Glu Gly Leu Leu Lys 
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245 250 255 

His He Leu Tr.r Val Met Ser Met Ala Ala Arc He Tyr Lys His ?rc 
26: 265 2^C 

5 

Ser He Arg Asr. Ser Val Asn Leu Val Val Val Lys Val Leu He Val 
275 260 235 

Glu Lys Glu Arg Trp Gly Fro Glu Val Ser Asp Asn Gly Gly Leu Thr 
10 293 295 300 

Leu Arg Asn Phe Cys Ser Trp Gin Arg Arg Fhe Asr. Lys Pro Ser Asp 
3 05 310 315 32 0 

15 Arg His Fro Glu His Tyr Asp Thr Ala He Leu Phe Thr Arg Gin Asn 

22 5 330 335 

?he Cys Gly Lys Gly Glu Gin Cys Asp Thr Leu Gly Met. Ala Asp Val 
240 345 350 

20 

Gly Thr He Cys Asp Pro Asp Lys Ser Cys Ser Val He Lys Asp Glu 
355 363 365 

Gly Leu Gin Ala Ala Tyr Thr Leu Ala His Glu Leu Gly His Val Leu 
25 ' 3 7C 3 75 38 0 

Ser Met Pro His Asp Asp Ser Lys Pro Cys Val Arg Leu Fhe Gly Pro 
3E5 3 90' 395 400 

3 0 Met Gly Lys Tyr His Met Xet Ala Pro Phe Fhe He His Val Asn Lys 

405 410 415 

Thr Leu Pro Trp Ser Pro Cys Ser Ala Val Tyr Leu Thr Glu Leu Leu 
4 2 C 425 450 

35 

Asp Asp Gly His Gly Asp Cys Leu Leu Asp Ala Fro Thr Ser Val Leu 

4 35 440 445 

Pre Leu Fro Thr Gly Leu Pre Gly Kis Ser Thr Leu Tyr Glu Leu Asp 

4 0 4 5C * 4 55 46 C 

Gin Gir. Cys Lys Gin He Phe Gly Pro Asp ?he Arg His Cys Fro Asn 
4S5 * 470 4^5 48C 

45 Thi Ser VH GH Asp He Cys Val Gin Leu Cys Ala Arg H : s Arg Asp 

4 5 5 4 H 4 5 5 

Hi Asl. Glu r:: He Sys H= Ihr Lys At:". Sly C.r Le„ It- Hp Ala 

5:: ' 5:5 =:■: 

Z f ■ 

Asp Gly Thr Pre Cys Gly Fr: Gly His Leu Cys Leu Asp Gly Ser cys 

5 1 5 521- 52 5 

Val Leu Lys Glu Asp Val C-lu Asn Fr~ Lys Ala Val Val Asp Gly Asp 

5 5 53: 33 5 54 3 

Trp Gly 7 : Trp Arg Fro Trp Sly C-ln Cys Ser Arg Thr Cys Sly Gly 
c z ^ - " 5 5 5 5 c ' 
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5 9 5 6 C 0 5 C 5 

Glu Lys Tyr Asr. Ala Tyr Asn His Thr Asp Leu Asp Gly Asn Phe Leu 
610 615 62C 

5 

C-ln Trp Val Fro Lys Tyr =er Gly Val Ser Pro Arg Asp Arg Cys Lys 
625 6 3 0 63 5 64C 

Leu Phe Cys Arg Ala Arc Gly Arg Ser Glu Phe Lys Val Fhe Glu Ala 
10 6 45 ~ 65 0 " 655 

Lys Val He Asp Gly Thr Leu Cys Gly Pro Asp Thr Leu Ser He Cys 
£60 665 670 

15 Val Arg Gly Gin Cys Val Lys Ala Gly Cys Asp His Val Val Asn Ser 
575 eao 665 

Fro Lys Lys Leu Asp Lys Cys Gly Val Cys Gly Gly Lys Gly Thr Ala 

6 9 C 69 5 7C0 

2 0 

Cys Arg Lys He Ser Gly Ser Fhe Thr Pro Fhe Ser Tyr Gly Tyr Asr. 
7 C 5 71C 715 720 

.Asp He Val Thr He Fro Ala Gly Ala Thr Asn He Asp Val Lys Gin 
25 " 725 73 C 735 

Arg Ser His Pro Gly Val Arg Asn Asp Gly Ser Tyr Leu Ala Leu Lys 
74C 745 750 

3 0 Thr Ala Asn Gly Gin Tyr Leu Leu Asn Gly Asn Leu Ala He Ser Ala 

755 760 765 

lie Glu Gin Asp He Leu Val Lys Gly Thr He Leu Lys Tyr Ser Gly 

77 0 ^75 780 

35 

Ser h'.et Ala Thr Leu Glu Arg Leu Gin Ser Phe Gin Ala Leu Pre Glu 
735 79C 795 60G 

Fro Leu Thr Val Gin Leu Leu Thr Val Ser Gly Glu Val Phe Pre Pro 
4C 505 310 815 

Lys Val Arg Tyr Thr Fhe Fhe Val Pro As:; Asp Xez Asp Fhe Ser Val 
S20 £2 5 83 0 

4 5 ■ . :: As:: Ser Lys Gl _ Arg Ala Thr Thr Asn He He Gin Ser Leu Fr; 

5 :• = 5 n s -j 5 

Ser AH Glu Tip V~ ". L*u Gly Asp Trp ie: Glu Cys Fr: Ser Tr.r Cy = 

±zZ ' =5: 

7. - 

Arg Gly Ser Trp Gin Arg Arg Thr Val Glu Cys Arg Asp Pro Ser Gly 

S65 97 C 375 53 3 

Gin Ala Ser Asp Thr Cys Asp Glu Ala Leu Lys Pre Glu Asp Ala Lys 

55 S=5 69: 595 

Pro Cys Gly £ti Gin Pro Cys Pro Lc 
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c2::> 2 . . i : 
5 <^;:> 

eg agg gca gaa ggc get age gag ccg cca ccg ccc ctg ggg gec acg 
Arg Ala Glu Gly Ala Ser Glu Fro Pro Pro Pro Leu Gly Ala Thr 
15 10 15 

10 age agg acc aag egg tt: gtg tct gag gcg cgc :tc gtg gag acg ctg 
Ser Arg Thr Lvs Arg Fhe Val Ser Glu Ala Arg Phe Val Glu Thr Leu 
20 25 30 



55 



ctg gcg :cc gat gcg ccc acg get gec tt: tac ggc gee gac ccg cag 143 

15 Leu Val Ala Asc Ala Ser Met Ala Ala Fhe Tyr Gly Ala Aap Leu Glr. 

3 5 4? 45 

aac cac ate ctg acg eta atg tct gtg gca gec cga ate cac aag cac 191 

ASH His l-e Leu Tar Leu Met Ser Val Ala Ala Arg lie Tyr Lys His 

20 50 55 60 

ccc age ate aag aat tec ate aac ctg atg gtg gta aaa gtg ctg ate 219 

Pre Ser lie Lys Asn Ser He Asn Leu Met Val Val Lys Val Leu lie 

65 70 75 

25 

gta gaa gat gaa aaa tgg ggc cca gag gtg tec gac aat ggg ggg etc 2=;7 

Val Glu Asp Glu Lys Trp Gly Pro Glu Val Ser Asp Asn Gly Gly Leu 

8C 85 90 95 

3C aca ctg egt aac etc tgc aac tgg cag egg cgt ttc aac cag ccc age 3" 5 

Thr Leu Arg Asn Phe Cys Asr. Trp Glr. Arg Arg Fhe Asn Gin Frc Ser 

ic: * 105 110 

gac :g; ;ac cca gag cac tac gac acg gec ate ctg etc ac: aga cag 3t3 

3 5 Asp Arg His Pre Glu His Tyr Asp Thr Ala He Leu Leu Thr Arg Gin 

115 12C 125 

aac ttc tgt ggg cag gag ggg ctg tgt gac acc ccg cgt gtg gca gac 4 '■ 1 

Asn Phe Cys Gly Gin Glu Gly Leu Cys Asp Thr Leu Gly Val Ala Asp 

4C 130 * 135 14 0 

at - 939 3CC a -"- tgt gac ccc aac aaa age tgc tec gtg at: gag gat 475 

lie Glv Thr He Cys Asp Pro Asn Lys Ser Cys Ser Val He Glu Asp 

1 4 E :. 5 C 15 5 

4 E 

gag ggg etc cag gcg gee cac acc ctg gcr cat ga^ eta gg^ cac gte EI" 

Glu Glv Leu Gin Ala Ala Hie Thr Leu Ala His Glu Leu Gly His 

:c: :c= i~: 

-Z etc age itg ::c cac gac -gar tec -wj cc. tgc y.u egg etc tec g j j i' - 

Leu Ser Vet Fro His Asp Asp Ser Lys Pre Cys Thr Arg Leu Phe Gly 

15 0 15 5 19C 

ccc atg urge aac cac cac gtg atg gca ccg ctg ttc gtc cac ctg aac t23 

5 5 Pre Met Glv Lys His His Val Ve_ A: a Pre Leu Phe Val His Leu Asn 

19' 2:0 1C5 
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Leu Lys Cys Asp Leu .Vet 
24 C 24 5 



5 <21C> II 

<2i:> 245 

<212> ?RT 

<213> Homo sapiens ADAMTS-8 

10 <icc> 11 

Arg Ala Glu Gly Ala Ser Glu Pro Pro Fro Frc Leu Gly Ala Thr Ser 
: s 10 15 

Arg Thr Lys Arg Phe Val Ser Glu Ala Arg Phe Val Glu Thr Leu Leu 
15 23 25 33 

Val Ala Asp Ala Ser Vet Ala Ala Phe Tyr Gly Ala Asp Leu Gin Asn 
35 40 45 

2 0 His lie Leu Thr Leu Met Ser Val Ala Ala Arg lie Tyr Lys His Pro 

50 55 60 

Ser lie Lys Asn Ser He Asn Leu Vet Val Val Lys Val Leu He Val 
6 5 7C 7 5 8C 

25 

Glu Asp Glu Lys Trp Gly Pro Glu Val Ser Asp Asn Gly Gly Leu Thr 
6 5 90 9 5 

Leu Arg Asn Phe Cys Asn Trp Gin Arg Arg Phe Asn Gin Pro Ser Asp 

3 0 100 10E 110 

Arg His Fro Glu His Tyr Asp Thr Ala He Leu Leu Thr Arg Gin Asn 
115 120 125 

35 Phe Cys Gly Gin Glu Gly Leu Cys Asp Thr Leu Gly Val Ala Asp He 
120 " 135 140 

Gly Thr He Cys Asp Pre Asn Lys Ser Cys Ser Val He Glu Asp Glu 
145 150 155 16C 

4 0 

Gly Leu Gin Ala Ala His Thr Leu Ala His Glu Leu Gly His Val Leu 
16 5 170 175 

Ser Me: Pre His Asp Asp Ser Lys Pro Cys Thr Arg Leu Phe Gly Pre 
■H ! = 1 £ 7 : r C 

Me I Cly Lys His s Val Vet. Ala Fr 1 Leu Val His Leu Asn Gin 

17 T'r.i Hi Ti: Hi Fr:. I've 2 c r A7 3 Fr.e 7er Gly 1'ys L^_: 

1IC 215 22 7' 

Gin Gly Trp He His Phe Lys Tyr Leu Cys lys Cys Val Ser Glu Leu 
22 5 23 0 2 25 24 0 

55 

Lys Cys Asp Leu "el 
24 5 
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<22i> cls 

< 2 2 2 > : 2 6 4 = : 

< 2 2 0 > 

5 <2'21> rr.ise_feature 

<222> (1563) . . {25 65) 

<221> rr.isc_f eature 
10 <I22> 11404) .. (1406) 

< 4 0 0 > 12 

ga age a;: atg cag tit gta :cc tgg gec aca ctg eta aeg etc etg 47 

3er Thr Met Gin Fhe Val Ser Trp Ala Thr Leu leu Thr Leu Leu 

15 1 5 10 15 

gtg egg gac etg gee gag atg ggg age cca gac gec gcg geg gee gtg 95 

Val Arc Asp Leu Ala Glu Met Giy Ser Pre Asp Ala Ala Ala Ala Val 

2 C 2 5 3 j 

20 

cgc aag gac agg ctg cac ccg agg caa gtg aaa tta tta gag acc etc 143 

Arg Lys Asp Arg Leu His Fro Arg Gin Val Lys Leu Leu Glu Thr Leu 

35 40 45 

25 age gaa tac gaa ate gtg tet ccc ate cga gtg aac get etc gga gaa 151 
Ser Glu Tyr Glu lie Val Ser Pro lie Arg Val Asn Ala Leu Gly Giu 
5C 55 60 

ecc ttt ccc acg aac gtc :ac ttc aaa aga acc ega egg age att aac 233 
3 0 Fro Phe Pro Thr Asn Val His Fhe Lys Arg Thr Arg Arg Ser lie Asr. 

65 70 75 

tet gec act gac ccc tgg cet gec ttc gee tec tec tet tee tec tet 237 
Ser Ala Thr Asp Pro Trp Pre Ala Phe Ala Ser Ser Ser Ser Ser Ser 
3550 35 5 0 95 

acc tec tec eag gcg eat tae egc etc tet gec ttc qgc cag cag ttt 335 

Thr Ser Ser Gin Ala His Tyr Arg Leu Ser Ala Fhe Gly Gin Gin Fhe 

1C0 1 C 5 110 

40 

eta ttt aat etc ace gee aat gec gga ttt ate get cca ctg ttc act 333 

Leu Phe Asn Leu Thr Ala Asn Ala Gly Phe lie Ala Pre Leu Fhe Thr 

115 120 125 

45 gtc arc etc ctt ggg acg ::; ggg gtg sat eag act aag ttt tat ttc 4 3. 1 
Val Thr Leu Leu Gly Thr Pr? 31 y Val A sr. Glr. Thr Lys Fh<= Tyr Ser 
1 3 C 13: 1 4 C 

g.ia gag gaa geg gaa tta a^g ear tgt :tc tae aaa agg tta tgt caa 4 ~3 
53 7-\>i Glu Glu Ala e;- ; y-.. ; Lys Hi? "ys Fh* Tyr Lys Arg 1, e a ~y~ ~-lr. 
14 5 15 0 15 5 

tae caa etc ega gca cac gge egt eat eag cet etg etc agg aat gaa 327 
Tyr Gin Leu Arg Ala His Gly Arg His Gin Pre Leu Leu Arg Asn Glu 
5 5 160 16 5 " 17C 1"5 

cat aaa aat agg car act aaa gac aag sag aaa ace aga g-^a aga aaa 3 " 3 
His Lvs Asn Arc H:f Ser ' vs Asr Lys T.vs Lys Tnr Ar:: A 1 * A: u 
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210 215 220 

acs aga gaa aag agg a:: cac aga age aca aaa eg: ccc tea tec cac H9 
Thr Arg Glu Lys Arg Tr.r His Arg Arc Thr Lys Arg Fhe Leu Ser Tyr 
5 2 2 5 " 23 3 " 23 5 

cca egg ttt gta gaa gee ttg gcg gtg gca gac aac aga atg get tea 767 

Pro Arg Fhe Val Glu Val Leu Val Val Ala Asp Asn Arg Ket Val Ser 

24C 245 250 255 

10 

tac cat gga gaa aac cct caa cae tat att tta act tta atg tea ate 315 

Tyr His Gly Glu Asn Leu Gin His Tyr lie Leu Thr Leu .Yet £er lie 

260 265 270 

15 eta gee tec ate tat aaa gac cca age att gga aat tta att aat att 863 
Val Ala Ser lie Tyr Lys Asp Fro Ser He Gly Asn Leu He Asn He 
275 23C 235 

gtt att gtg aac tta att gtg att cat aat gac. cag gat ggg cct tee HI 
2C Val He Val Asn Leu He Val He His Asn Glu Gin Asp Gly Fro Ser 
290 2S5 30C 

ata tct ttt aat get cag aca aca tta aaa aac ttt tg : cag tog cag 959 
He Ser Fhe Asn Ala Gin Thr Thr Leu Lys Asn Phe Cys Gin Trp Gin 
25 3C5 310 315 

eat teg aac agt cca ggt gga ate cat cat gat act get gtt etc tta 1007 

His Ser Asn Ser Pre Gly Gly He His His Asp Thr Ala Val Leu Leu 

32 0 3 25 333 3 35 

3C 

aca aga cag gat ate tgc aga get cac gac aaa tgt gat acc tta ggc 1C55 

Tr.r Arg Gin Asp He Cys Arg Ala His Asp Lys Cys Asp Thr Leu 31 y 

34C 345 350 

35 ctg get gaa ctg gga acc att tgt gat ccc tat aga agr tgt tct att 1103 
Leu Ala Glu Leu Gly Thr He Cys Asp Fro Tyr Arg Ser Cys Ser He 
355 ' 360 365 

agt gaa gat agt gga ttg act aca get ttt acg ate get eat gag ctg 1151 

4 0 Ser Glu Asp Ser Gly Leu Ser Thr Ala Phe Thr He Ala His Glu Leu 

37C 375 36) 

ggc cat gtg ttt aac atg cct cat gat cac aac aac aaa tgt aaa gaa 1199 
Gly His Val Phe Asn Met Pro His Asp Asp Asn Asn Lys Cys Lys Glu 
45 3fE 393 395 

gaa gga gtt aag agt ccc cag cat gc c atg get cca ac.i ctg aac ttc 1 2 4 " 
Glu Gly VH Lys £er Pre Gin His Vgl Ke t Ala Frc Th. Leu As:. Fr.e 

n: c 4 c 5 4 : : 4:5 

5 3 

tac acc aac ccc tgg atg tgg tea aag tgt agt cga aaa tat ate act 12 

Tyr Thr Asn Frc Trp Ket Trp Ser Lys Cys Ser Arg Lys Tyr He Thr 
42 0 42 5 4 3 0 

55 gag ttt tta gac at: ggt tat ggc gag tgt ttg cct aac gaa cct gaa 1343 
Glu ?.-.e Leu Asp Thr Gly Tyr Gly Glu Cys Leu Leu Asn Glu Frc Glu 
4H 4 4:> 44: 
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cca tat ate atg cag tec aga egg etc tgg tgc aa: as: g:c aa: gga 1 4 8 ' 
Pro Tyr Xec Met Gin Cys Arg Arg Leu Trp Cys Asn Asr. Val Asr. Gly 
4EC 4 8 E 4 90 4 95 



5 gta cac aaa gge tgc egg act cag cac aca ccc egg gec gat ggg ac: 
Val His Lys Gly Cys Arg Thr 31n His Thr Pre Trp Ala Asp Gly Thr 
ECC 505 510 



153: 



gag tgc gag ;et gga aag cac tgc aag nat gaa ttt tgt gtt ccc aaa 15E3 

10 Glu Cys Glu Pro Gly Lys His Cys Lys Xaa Gly ?he Cys Val Pro Lys 

515 52C 525 

gaa arc gat gtc ccc gtg aca gat gga tec tgg gga agt egg act ccc 1631 

Glu Met Asp Val Fro Val Thr Asp Gly Ser Trp Gly Ser Trp Ser ?rz> 

15 530 535 540 

ttt gga acc tgc tec aga aca tgt gga ggg ggc ate aaa aca gee att l£~9 

Phe Gly Thr Cys Ser Arg Thr Cys Gly Gly Gly lie Lys Thr Ala He 

545 550 555 

20 

cga gag tgc aac aga cca gaa cca aaa aat ggt gga aaa tac tgt gta 1727 

Arq Glu Cys Asr. Arg Fro Glu Pre Lys Asn Gly Gly Lys Tyr Cys Val 

56C 565 570 575 

25 gga cgt aga arg aaa ttt aag tee tgc aac aeg cag cca eat etc aag 1775 

Gly Arg Arg Yet Lys Phe Lys Ser Cys Asn Thr Glu Pro Cys Leu Lys 

580 58 5 5SD 

cag aag cga gac ttc cga gat. gaa cag tgt get cac ttt gac ggg aag 1523 

3 0 Gin Lys Arg Asp Phe Arg Asp 31u Glr. Cys Ala His Phe Asp C-ly Lys 

"5 95 SZC 60 5 

cat ttt aac ate aac ggt ctg ctt ccc aat gtg egc tgg gtc cot aaa 1671 

His Phe Asn lie Asn Gly Leu Leu Fro Asn Val Arg Trp Val Pro Lys 

3 5 610 615 620 

tae agt gga att ctg atg aag gac egg tgc aag ctg ttc tgc aga gtg 1515 

Tyr Ser Gly He Leu Met Lys Asp Arg Cys Lys Leu Phe Cys Arg Val 

625 63C 635 

4 0 

oca ggg aac aca gee tac tat cag ctt cga gac aga gtg ata gat gga 1S67 

Ala GH* Asn Thr Ala Tyr Tyr Hn Leu Arg Asp Arg Val He Asp Gly 

640 ' 645 650 655 

45 att cct tgt gge cag gac aca aat gr. ate tgt gte cag gge ctt tg; 2 0 1 5 

Thr ?r: Cys y Gin Asp Tnr Asn Asr. He Cys Val Gin Gly Leu Cys 

esc 5-e 6c: 

egg ea = get gga tgc gat eat gtt tta aac tea aaa gee egg a-jj gat H=- 

v At"; 31 t Ala 31 y Cys A;[. His- VhI L e l. Asr 5-rr I.ys A 1 a Arg Arg A = r 

i-'s 6 = C £5 5 

aaa tgt ggg tgt ggt gge gat aat tot tea tgc aaa aca gtg gca 2111 

Lys Cys Gly Val Cys Gly Gly Asp Asn Ser Ser Cys Lys Thr Val Ala 

5 5 * * o9C 6 55 7;;: 

a:a aca ttt aat aca g t a cat tat cgt tae aat set gtg gtc e 3 a att H 5 9 

G", v "Thr Ph~ Asn Thr V-:l H : s Tvr G". y Tvr Asr. Thr V=s 1 Va 1 Av: 1 1 r- 



: : :• 



A. . Asr. 



... 
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-4C 7^5 7SC 

ttg eta aat gga aa: tct gtt =t: aca stg gec aaa agg gaa at: 

Fhe Leu Leu Asn Glv Asn Phe Val Val Thr Met Ala Lys Arg Glu lis 

5 ?55 76C 765 



gec cag tgt ggc ttg ggc tac cgc aca ttg can ate tac tgt gec aaa 
4 0 Ala Gin Cys Gly Leu C-ly Tyr Arg Thr Leu Asp lie Tyr Cys Ala Lys 
SCO 9 05 SIS 



2 :■ c : 



23 El 



cgc at- ggg aac get gcg gta gag tac age ggg tec gag act gec gta 

Arg lie Gly Asn Ala Val Val Glu Tyr Ser Gly Ser Glu Thr Ala Val 

770 775 760 

10 

gaa aga att aac tea aca gat cgc att gag caa gaa ctt ttg ctt cag 23S9 

Glu Arg He Aen Ser Thr Asp Arg lie Glu Gin Glu Leu Leu Leu Gin 

785 750 7SE 

IE gtt ttg teg gtg gga aag ttg tac aac :cc gat gta cgc tat tct tec 2447 

Val Leu Ser Val Gly Lys Leu Tyr Asn Fro Asp Val Arg Tyr Ser Phe 

QjZ J 835 * 510 815 

rr? ?.tt gaa gac aaa cct cag cag ttt tac tgg aac agt cat 2133 

20 Asn He Fro He Glu Asp Lys Pro Gin Gin Phe Tyr Trp Asn Ser His 

620 825 :2C 

ggc cca tgg caa gca tgc agt aaa ccc cgc caa ggg gaa egg aaa cga 2542 

Glv Fro Trp Gin Ala Cys Ser Lys Pro :ys Gin Gly Glu Arg Lys Arg 

25 635 840 645 

aaa ctt gtt tgc acc agg gaa tct gat cag ctt act gtt tct gat caa 

Lys Leu Val Cys Thr Arg Glu Ser Asp Gin Leu Thr Val Ser Asp Gin 

850 S55 S6C 

3 0 

aga tgc gat egg ctg ccc cag cct gga cac att act gaa ccc tgt ggt 

Arg Cys Asp Arg Leu Pro Gin Pre Gly His He Thr Glu Pro Cys Gly 

^355 6 70 875 

35 aca ggc tgt gac ctg agg tgg cat gtt gec age agg agt gaa tgt act 266" 

Thr Gly Cys Asp Leu Arg Trp His Val Ala Ser Arg Ser Glu -Cys Ser 

8 6 0 * 8 65 8 90 89 5 



2591 



273 5 



tat age agg ctg gat ggg aag act gag aag gtt gat gat ggt ttt tgc 2763 

Tyr Ser Arg Leu Asp Gly Lys Thr Glu Lys Val Asp Asp Gly Phe Cys 

45 915 92: ?lo 

age age cat ccc aaa cca age aac eg: gaa aaa tg: tea ggg gaa tgt 1-1 '. 

Ser Sor His Pre Ly = t ro Ser Asn Arg Glu Lys Cys Ser Gly G- J Cys 

331 ' 3H 34C 

50 

aac acg ggt ggc tgg cgc tat tct gec tgg act gaa tgt tea aaa age 

Asn Thr Gly Gly Trp Arg Tyr Ser Ala Trp Thr Glu Cys Ser Lys Ser 

345 ' ' 350 555 

55 tgt gac ggt ggg ace cag agg aga agg get att tgt gtc aat acc cga 

Gys Asp Gly Gly Thr Gin Arg Arg Arg Ala lie Cys Val Asn Thr Arg 
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-33 tea gag tgc ctg gec ac: tgc 99a aaa ggg cat aag cac age cag 3071 

Trc Ser Glu Cys Leu Val Thr Cys Gly Lys Gly His Lys His Ser Glr. 

:01c ' 1015 1:2: 

5 gtc tgg tgt rag ttt ggt gaa gat cca tta aat cat aja atg tgc gac 311? 

Val Trp Cys Gin ?he Gly Glu Asp Arg Leu Asn Asp Arg Met Cys Asp 

1025 1030 1035 

cc: gag acc aag cca aca tec atg cag act tgt cag cag ccg gaa tgt 315? 

1C Frc Glu Thr Lys Pre Thr Ser Met Gin Thr Cys Gin Gin Pro Glu Cys 
1040 1045 1050 1055 



gca tec egg cag gcg ggt ccc tgc gta cag tgc agt gtc act tgt gga 

Ala Ser Trp Gin Ala Gly Pro Trp Val Gin Cys Ser Val Thr Cys Gly 

15 1060 1065 1070 

cag gga tac cag eta aga gca gtg aaa tgc a:: att ggg act tat atg 

Gin Gly Tyr Gin Leu Arg Ala Val Lys Cys lie lie 3ly Thr Tyr Xer 

2C?5 1 C S 0 .0^ 

2 0 

tea gtg gta gat gac aat gac tgt aat gca gca act aca cca act gat 

Ser Val Val Asp Asp Asn Asp Cys Asn Ala Ala Thr Arg Pro Thr Asp 

109C " 1095 11C0 

25 acc cag gac tgt 33a tta cca tea tgt cat cc: ccc era get gec ccg 

Thr Gin Asp Cys Glu Leu Pro Ser Cys His Pre Pre Pro Ala Ala Pre 

1105 1110 1115 

gaa acg agg aga age aca tac agt gca cca aga acc cag tag cga ttt 

3 0 Glu Thr Arg Arg Ser Thr Tyr Ser Ala Fro Arg Thr 31n Trp Arg Phe 

1120 " 112 5 1130 1135 



3215 



3 2C3 



3311 



335S 



34 3- 



ggg tct egg acc cca tgc tea gec act tgt ggg aaa jgt acc egg atg 3455 

Gly Ser Trp Thr Pro Cys Ser Ala Thr Cys Gly Lys 2ly Thr Arg Met 

35 1140 1145 1150 

aga tac gtc age tgc cga gat gag aat ggc tct gtg get gac gag agt 3503 

Arg Tyr Val Ser Cys Arg Asp Glu Asn Gly Ser Val Ala Asp Glu Ser 

1155 1160 1165 

4 0 

gec tgt get acc ctg cct aga cca gtg gca aag gaa gaa tgt tct gtg 3551 

Ala Cys Ala Thr Leu Pre Arg Pro Val Ala Lys Glu 31u Cys Ser Val 

117C 1175 HeD 

4 5 aca ccc tgt ggg caa tgc aag gee teg gac tgg age trt tgc tct gtg 35?? 

Thr Pre 3y£ 31 y Glr. Trp lys Ala Leu Asp Trp Ser Ser Cys Ser Val 

1155 ' ' 115: * ~11*5 

-it: tgt gg ggt agg gca arc egg caa gtg atg tgt gtc a«c tar 3 -,4 7 

5C Thr Cv£7 Glv Gin 7-ly Arg A>. Thr Arg 7-1 r. V". I Met C . ■= Val Asr. jyr 

12?: ' ' 1205 121C 1215 

agt gac cac gtg ate gat egg agt gag tgt gac cag gat tat ate cca 35^5 

Ser Asp His Val He Asp Arg Ser Glu Cys Asp Gin Asp Tyr He Frc 

55 1220 1225 12 3 0 

^aa act c*-; eag gac tgt tee atg tea ;:a tgc ret caa ag; arc ecu J'4l 

Glu Thr Asp Gin Asr Cys Ser V?t Ser Pre "vs 7r- ~\r Arc Thr • 
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:2c: 12-s 

aga ac: ggc ccc tgg gga gea cgt tec agt acc tgt get ggc gga :cc 

Arg Thr Gly Pre Trp Sly Ala Cys Ser 3er Thr Cys Ala Gly Gly 3er 

5 1280 *12B5 12 90 2295 

cag egg cgt gtt gtt gca tgt cag gat gaa aa: gga Lac a:: gca aac 

Gin Arg Arc: val Vai Val Cys Gin Asp Glu Asn Gly Tyr Thr Ala Asn 

13 00 13 05 1310 

10 

gac tgt gtg gag aga ata aaa cct gat gag caa aga gec tgt gaa tec 

Asp Cys Val Glu Arg He Lye Pre Asp Glu Gin Arg Ala Cys Glu Ser 

1315 " ' 1320 1325 

15 gge cct tgt ect cag tgg get tat ggc aac tgg gga gag tgc act aag 

Gly Pro Cys Fro Gin Trp Ala Tyr Gly Asn Trp Gly Glu Cys Thr Lys 
133-0 13 3 5 134C 

ctg tgt ggt gga gge ata aga aea aga ctg gtg gte Let cag egg tec 

2C Leu Cys Gly Gly Gly lie Arg Thr Arg Leu Val Val Ser Gin Arg Ser 

1345 " 135C 1355 

aac ggt gaa egg ttt cea gat ttg age tgt gaa ate etc gat aaa cct 

Asn Gly Glu Arg Phe Pro Asp Leu Ser Cys Glu lie Leu Asp Lys Pre 

25 1350 13 6 5 137 0 1375 



35 cat aaa caa :ga aat gtt tac tgc atg gca aaa gat gga age cat tta 

His Lys Gin Arg Asn Val Tyr Cys Met Ala Lys Asp Gly Ser His Leu 

1410 " 1415 1420 

gaa agt gat tac tgt aag eac ctg get aag cca eat ggg cac aga aag 

4 0 Glu Scr Asp Tyr Cys L/s His Leu Ala Lys Pro Kis Gly His Arg Lys 

1425 ' * 1430 1435 

tgc cga gga gga aga tgc ccc aaa tgg aaa get ggc get tgg agt cag 

Cys Arg Gly Gly Arg Cys Pro Lys Trp Lys Ala Gly Ala Trp Ser Gin 

4 5 1440 144: 14 5 C 14 5 5 

tgc tct gtg tee atg gge cga ggc gta cag cag agg cat gtg ggc tgt 

Cvs ier Val l-jr M e -_ 3ly Aig Gly Val Gin Jin Arg His Val Gly Cye 

14- C 14 £5 14"2 

r ~, 

cag ate gga aca cac aaa ata gee aga gag acc gag tgc aac cea tac 

Gin lie Gly Thr His Lys lie Ala Arg Glu Thr Glu Cys Asn Pre Tyr 

*14 7 5 146 : 14 3 5 

55 ace aga eeg gag teg gaa tgc gaa tgc caa ggc cca egg tgc eee ctt 

Tnr Arc Pro Glu Ser 31 u Cys Glu Cys Gin Cly Pre Arg lys Pre leu 

~14=m: 14 55 15 CO 



: 85" 



3S35 



3 563 



4C31 



412" 



41^5 



ccc gat cgt gag cag tgt aac aca cat get tgt cca cac gac get gca 

Pro Asp Arg Glu Gin Cys Asn Thr His Ala Cys Pro His Asp Ala Ala 

1380 ' 1335 1390 

30 

tgg agt act ggc cct tgg age teg tgt tct gtc tec cgt cgt cga ggc 4223 

Trp Ser Thr Gly Pro Trp Ser Ser Cys Ser Val Ser Cys Gly Arg Gly 

13S5 * 14C0 1405 



4 271 



4 :■■ 1 3 



4 36^ 



4 4 1 



4 511 
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gag gtg ca: egg gca cgc tgc gac gtg age aag egg ccg gtg gar cgt 46s:- 

Glu Val His Glv Ala Arg Cys Asp Va; Ssr Lys Arg Pre Val Asp Arg 

154C 1545 1550 

5 gaa age tgc agt ctg :aa ::c cgc gag tae etc tgg act aca gga gaa 47C3 

Glu 3er Cys Ser Leu Glr. Pro Cys Glu Tyr Val Trp Thr Thr Gly Glu 

1555 156C 1565 

egg :ca gag tgc tea gtg acc Cgt gga aaa ggc tac aaa caa agg ctt 

10 Trp Ser Glu Cya Ser Val Thr Cys Gly Lys Gly Tyr Lys Gin Arg Leu 

1570 ' 1575 1560 

gtc teg tgc age gag att tac acc ggg aaa gag aa: tat gaa tac age 

Val Ser Zys Ser Glu He Tyr Thr 31y Lys Glu Asn Tyr Glu Tyr Ser 

15 1555 1590 1555 

tac caa acc acc ate aac cgc cca ggc acg cag ccc ccc agt gtt cac 4S4~ 

Tyr Gin Thr Thr He Asn Cys Fro :-ly Thr Gin Fro Pre Ser Val His 

16 CO 1SC5 1C13 1£15 

20 

ccc tgt tac ctg agg gag cgc ccc gtc teg gec acc tgg aga gtt ggc 

Pre Cys Tyr Leu Arg Glu Cys Pro Val Ser Ala Thr Trp Arg Val Gly 

1620 1525 1530 



ret gtg caa tgt tta acc 5at gag 55c caa ccc age cac tea tgc cac 

3G Ser Val Gin Cys Leu Thr Asn Glu Asp Gin Fro Ser His Leu Cys His 
155: 1655 16 6 0 

act gat ctg aag cca gaa gaa cga aaa ac: tgc cgc aat gtc tat aac 

Thr Asp Leu Lys Fro Glu 31u Arg Lys Thr Cys Arg Asn Val Tyr Asn 
35 1665 1*70 1675 

tgt gag eta ccc cag aat cgc aag gag gta aaa aga ctt aaa ggt ccc 

Cys Glu Leu Frc Gin Asn Zys Lys 31u Val Lys Arg Leu Lys Gly Ala 

1S3D 1665 169C 1635 

4C 

age gaa gat ggt gaa tat etc ctg atg att aga gga aag ctt ctg aag 

Ser Glu Asp Gly Glu Tyr ?he Leu Met: He Arg Gly Lys Leu Leu Lys 
1700 1705 171C 

4 c aea etc eg: gee ggg atg cac cct gac ca c ccc aaa gag eac gtg aca 

He ?he Cy? A 1 a Gly Met His Ser Asp H: 5 Pre Lys Glu Tyr Val Thr 

1 1 1 5 7 : 0 17 2 5 

ctg gtg cue gga gar ect gag ar. etc tec gag gtt tae ggg e-c a gg 

51 Leu Val Hi? Gly Asp Ser Glu Asn Fee Ser Glu Val Tyr Gly His Arg 
17;c -735 174; 

tta cac aac cca aca gaa tgt ccc tat aac ggg age egg cgc gat gac 

Leu His Asn Fro Thr Glu Cys Pro Tyr Asn Gly Ser Arg Arg Asp Asp 

5 5 17 4 5 17 5 C "5 3 

tgc caa cgt egg aag gat tac acg gec get ggg ttt tee ugt tee cag 

-It ~\\: -\r : Acc ~v: All A1-: v :• -■■r ••: 7-he "In 



4751 



4 79 9 



489^ 



25 aac tgg ggg age tgc tea gtg tee tgt gge get gga ctg atg cag aga 4543 
Asn Trp Gly Ser Cys Ser Val Ser Cys Gly Val Gly Val Met Gin Arg 
1635 164C 1645 



4 9 91 



5039 



50S7 



5135 



52 "9 
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1755 1SCC ISCr 

999 gai tgc -ar age get cc: aag tgc c:a cag ggt cgt ::: age ate 54 7 : 
Sly Asp Cys Tyr Ser Ala Ala L/s Cys Frc Gin Gly Arg Phe Ser He 
5 1810 IB 15 162 C 

aa: c:t tat gga acc ggc ttg tct tta act gaa tct gee aga tgg ata 5519 

Asn Leu Tyr Gly Thr Gly Leu Ser Leu Thr Glu Ser Ala Arg Trp He 
1625 1330 1835 

1C 

tea caa ggg aat tat get gtc tct gac ate aag aag teg ccg gat ggt 5567 

Ser Gin Gly Asn Tyr Ala val Ser Asp He Lys Lys Ser Pro Asp Gly 
1640 1345 1E50 1855 

15 ace ccq are gta ggg aaa tgc ggt ggt tac tgt gga aaa tge aet cca 5615 
Thr Arg Val Val Gly Lys Cys Gly Gly Tyr Cys Gly Lys Cys Thr Pro 
166C 1865 1670 

:cc lcl ggt act ggc ccq gag gyy uga yet tta nagccaaggt gecrzga Q gj 5tt5 
2C Ser Ser Gly Thr Gly Leu Glu Val Arg Val Leu 
1675 * 186C 

egaagecatt atggatggat gaaggatagt aatgeaatae ctccacctta atttgggtgc 5725 

25 atgtgtatgt gtgtgtgtgt ttgtgtgtga ettgtatget tgtgtgtgta aatgtg^gta 576B 

catatacata tataca 56C4 



30 <.21C> 13 

<211> 1682 
c212> ?RT 

<213> Herr.o sapiens ALAXTS-S 

3 5 <40 0> 13 

Ser Thr Xet Gin Phe Val Ser Trp Ala Thr Leu Leu Thr Leu Leu Val 
15 10 15 

Arc: Asp Leu Ala Glu Met Gly Ser Pro Asp Ala Ala Ala Ala Val Arg 

4C 20 25 3C 

Lys Asp Arg Leu Kis Frc Arg Gin Val Lys Leu Leu Glu Thr Leu Ser 
35 40 45 

4 5 Glu Tyr Glu lie Val Ser Pro He Arg Vil Asn Ala Leu Sly Glu Pre 

i ; 51 t : 

Phe ?r ; Tr.r Asr. Vol Hi a P:ie Lys Arg ll'ir Arj A: j He A;-r. ;t: 

£5 7C ~5 cZ 

r ' 

Ala Tr.r Asp Pre Trr Frc Ala Phe Ala Ser ser Ser Ser Ser Ser Thr 

c5 9C 55 

Ser Ser Gin Ala His Tyr Arg Leu Ser Ala Phe Gly Gin Sin Fhe :.eu 

5 5 ICC ICE li: 

Phe Asr. Leu Thr Ala Asr. Ala Gly Phe He A In Pre Leu Phe Thr Val 
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It 5 



175 



Lys Asr. Arc His Ser Lys Asp Lys Lys Lys 7r.r Arg Ala Arg Lys Trp 



: so 



135 



1?0 



Gly Glu Arc He Asn Leu Ala Gly Asp Val Ala Ala Leu Asn Ser Gly 



195 



20C 



i 05 



Leu Ala Thr Glu Ala Phe Ser Ala Tyr Gly Asr. Lys Thr Asp Asn Thr 
10 21C 215 220 

Arg Glu Lys Arg Thr His Arg Arg Thr Lys Arc Phe Leu Ser Tyr Pro 



125 



23 C 



23 5 



24Q 



15 Arg Phe Val Glu Val Leu Val Val Ala Asp Asr. Arg Met Val Ser Tyr 



24 = 



250 



2 55 



His Gly Glu Asn Leu Glr. His Tyr He Leu Thr Leu Met Ser He Val 



27Q 



20 



Ala Ser He Tyr Lys Asp Fro Ser He Gly Asn Leu He Asn He Val 



275 



260 



265 



He Val Ash Leu He Val He His Asn Glu Glr. Asp Gly Pro Ser He 
2&C 295 300 

Ser Phe Asr. Ala Glr. Thr Thr Leu Lys Asn Phe Cys Gin Trp Gin His 



3 05 



310 



315 



32 C 



3 0 Ser Asn Ser Pro Gly Gly He His His Asp Thr Ala Val Leu Leu Thr 



325 



330 



335 



Arg Gin Asp He Cys Arg Ala His Asp Lys Cys Asp Thr Leu Gly Leu 



340 



345 



350 



35 



Ala Glu Leu Gly Thr He Cys Asp Pre Tyr Arg Ser Cys Ser He Ser 



3 55 



360 



365 



Glu Asp Ser Gly Leu Ser Thr Ala Phe Thr He Ala His Glu Leu Gly 
4 0 3 7C 3 75 3 80 

His Val Phe Asn Met Pro His Asp Asp Asr. Asn Lys Cys Lys Glu Glu 



3 65 

Gly Val Lys iier 



3 90 



395 



4 0C 



:ir. His Val Met Ala Pro Thr Leu Asn The Tyr 
4 1 : 4 15 



Thr Asn P:o Trp Met Trp Ser Ly_ lys Ser Arg Lys Tyr He .r.r Glu 



425 



43 : 



re Leu Asp Tnr Gly Tyr Gly Glu Cys Leu Leu Asn Glu rro Glu Ser 



4 ;5 



44 ; 



445 



Arg Pro Tyr Pro Leu Pro Val Gin Leu Pro Gly He Leu . yr Asn Val 
4 5 C 455 4£C 



Asn Lys Gin xaa Glu Leu He l-ne Gly Prr Gly Ser 

.; £ - ". ~ r 



Gin Val Cys Pre 
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515 520 525 

Y.er Asp Val Pro Val Tr.r Asp Gly Ser Trp Gly Ser Trp Ser Pro Fhe 

5 3 0 5 j 5 5 4 0 

5 

01 y Thr Cys Ser Arc Thr Cys Gly Gly 3Iy He Lys Thr Ala He Arg 

545 550 555 550 

Glu Cys Asn Arg Fro Glu Pro lys Asn Gly Gly Lys Tyr Cys Val Gly 
10 565 57C 575 

Arg Arg Met Lys Phe Lys Ser Cys Asn Thr Glu Pro Cys Leu Lys Gin 
56C 5=5 5 90 

15 Lys Arg Asp Phe Arg Asp Glu Gin Cys Ala His Phe Asp Gly Lys His 

5 95 6CC 5 05 

Phe Asr. He Asn Gly Leu Leu Fro Asn Val Arg Trp Val Fro Lys Tyr 
513 615 62 0 

20 

Ser Gly He Leu Met Lys Asp Arg Cys Lys Leu Phe Cys Arg Val Ala 
625 630 535 540 

Gly Asr. Tr.r Ala Tyr Tyr Gin Leu Arg Asp Arg Val He Asp Gly Thr 
25 545 550 655 

Pro Cys Gly Gin Asp Thr Asn Asp He Cys Val Gin Gly Leu Cys Arg 
660 665 670 

3 0 Gin Ala Gly Cys Asp His Val Leu Asn Ser Lys Ala Arg Arg Asp Lys 

675 680 €85 

Cys Gly Val Cys Gly Gly Asp Asn Ser Ser Cys Lys Thr Val Ala Gly 
690 655 7C0 

35 

Thr Phe Asn Thr Val His Tyr Gly Tyr Asn Thr Val Val Arg Tie Pro 
705 710 715 720 

Ala Gly Ala Thr Asn He Asp Val Arc Gin His Ser Phe Ser Gly Glu 
40 725 HO 735 

Thr Asp Asp Asp Asn Tyr Leu Ala Leu Ser Ser Ser Lys Gly Glu Phe 
740 745 750 

4 5 Leu Leu Asn Gly Asn Phe Val '.'si Thr Vet Ala Lys Arg Glu lie Arc 

"55 HO 755 

He Gly Asr. AH Val Val Hu Tyr Ser Hy ?<:-r Olu Thr Ala Vai GH 

~ 7 : - -; 5 ~ z J 

r. - 

Arg He- As:. H- r i'.-.r A sp Arg He Hu 3 1 n Gl - Leu Le_ G . r. V.s- 

t&5 h: "95 =:: 

Le- Ser Vsl Gly Lys Leu Tyr Asn Pre Asp Val Arg Tyr Ser Phe Asn 
55 £05 HO £15 

He Prr He 3h Asp lys Prt Gin jIt. 7 h-t Tyr Trp A=r. Ser His 7! y 

a 2 o H5 5 :■ : 
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56 5 EH e -7 - 8H 

Giv Cvs Asc Leu Ar- Trp His Val Ala Ser Arg Ser Glu Cys £er Ala 
3 8= 3?C 555 

5 

Gin Cys Gly Leu 31/ Tyr Arg Thr Leu Asp He Tyr Cys Ala Lys Tyr 
9CC 90 E 910 

Ser Arg Leu Asp Gl/ Lys Thr Glu Lys Val Asp Asp Gly Fhe Cys Ser 
10 " 515 920 925 

Ser His Pre Lys Fro Ser Asn Arg Glu Lys Cys Ser Gly Glu Cys Asr. 
933 S 3 5 94C 

15 Thr Gly Gly Trp Ar-3 Tyr Ser Ala Trp Thr Glu Cys Ser Lys Ser Cys 
545 ' S5C 555 9c*C 

Asp Gly Gly Thr Gin Arg Arg Arg Ala He Cys Val Asn Thr Arg Asn 

565 570 575 

20 

Asp Val Leu Asp Asp Ser Lys Cys Thr His Gin Glu Lys Val Thr He 
98C S 6 5 590 

Gin Arc Cys Ser Glu Phe Pro Cys Pre Gin Trp Lys Ser Gly Asp Trp 
25 " 995 10C 3 1005 

Ser Glu Cys Leu Val Thr Cys Gly Lys Gly His Lys His Ser 31 r. Val 
1 C 1 C 1015 102 0 

3 0 Trp Cys Gin Phe Gly Glu Asp Arg Leu Asn Asp Arg Met Cys As? Pre 
1025 103C 1035 1C4C 

Glu Thr Lys Pro Thr Ser Ket Gin Thr Cys C-ln Gin Pro Glu Cys Ala 
1C45 1050 1C55 

35 

Ser Trp Gin Ala Gly Fro Trp Val Gin Cys Ser Val Thr Cys Gly Gin 
1C6C * * H65 1070 

Gly Tyr Gin Leu Arg Ala Val Lys Cys He He Gly Thr Tyr y.ez Ser 

4C :o "5 mac i:s5 

Val Val Asp Asp Aen Asp Cys Asn Ala Ala Thr Arg Pre Thr Asp Thr 
1 C 9 C 10 9 5 1 1 C D 

45 Gin Asp Cys Glu Le Pre Ser Cys His Pre Pro Pre Ala Ala Fro Glu 

iioi " ' nic hie nz: 

Chi A: Ara Ser Thr Tyr Ser Ala He A: g Tnr Gl:; Trp Arg He Hy 

n :f in: 1H: 

Ser Trp Thr Pre Cys Ser Ala Thr Cys Gly Lys Gly Thr Arg Mei Arg 

in:- H45 in: 

Tyr Val Ser Cys Arg Asp Glu Asn Gly Ser Val Ala Asp Glu Ser Ala 
55 11 55 ' 116 0 116 5 

Cys Ala Tnr Leu He Arg Pre Val AH Lys Glu Glu Cys Ser Val Cm 

' — - - : : - = : : n 
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122C 1225 :230 

Thr Asp Glr. Asp Cys Ser >'.e: Ser Pro Cys i : -r: Glr. Arg Thr rre Asp 
12 3 5 12 4 0 12 4: 

5 

Ser Gly Leu Ala Glr. His Pre the Gin Asr. Glu Asp Tyr Arg Pro Arg 
1251 1255 1260 

Ser Ala Ser PrD Ser Arg Thr His Val Leu Gly Gly Asn Gin Trp Arc 
10 1265 1270 1275 128C 

Thr Gly Pro Trp Gly Ala Cys Ser Ser Thr Cys Ala Gly Gly Ser Glr. 

1265 1290 *1295 

15 Arg Arg Val Val Val Cys Gin Asp Glu Asn Gly Tyr Thr Ala Asn Asp 
13CC 1305 1310 

Cys Val Glu Arg He Lys Pre Asp Glu Glr. Arg Ala Cys Glu Ser Gly 
1313 1320 1325 

20 

Fro Cys Fro Glr. Trp Ala Tyr Gly Asn Trp Gly Glu Cys Thr Lys Leu 
1330 1335 1340 

Cys Gly Gly Gly He Arg Thr Arc Leu Val Val Ser Gin Arg Ser Asn 
25 1345 1250 1355 1360 

Gly Glu Arg Phe Pre Asp Leu Ser Cys Glu lie Leu Asp Lys Pro Pre 
1365 137C 1375 

3C Asp Arg Glu Gin Cys Asn Thr His Ala Cys Pro His Asp Ala Ala Trp 
1330 13S? 1390 

Ser Thr Gly Pre Trp Ser Ser Cys Ser Val Ser Cys Gly Arg Gly His 
1395 1400 14C5 

35 

Lys Gin Arg Asn Val Tyr Cys Met Ala Lys Asp Gly Ser His Leu Glu 
1410 1415 1420 

Ser Asp Tyr Cys Lys His Leu Ala Lys Fro His Gly His Arg Lys Cys 
40 1425 1430 1435 1440 

Arg Gly Gly Arg Cys Pro Lys Trp Lys Ala Gly Ala Trp Ser Gin Cys 
1445 1450 1455 

4 5 Ser Val Ser .Vet. Gly Arg Gly Val Glr. Glr Arg H : s Val Gly Cys Glr. 

1461 14-: 14^C 

11- Gly Thr i!;s Lys l.e Ala A:u Glu Thr Glu Cys Air Pre Ty; Thr 
1 4 " E 14 = : 14 = 3 

Arg Pre Glu Ser Glu Cys Glu Cys Glr. Gly Pre Arg Cys Pi: Leu Tyr 
1490 1495 15 0 1 

Thr Trp Arg Ala Glu Glu Trp Glr. Glu Cys Thr Lys Thr Cys Gly Glu 
55 15 0 5 1510 1515 152: 

Gly Se: Arg Tyr Arg Lys VesI Val lys Val Asp Asp Asn Lys Asr 31_ 

- 7. r ■ 7 ~ ~ z -i c 
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1 5 7 C - 5 E 15 3 0 

Ser Cys Ser Glu lie Tyr Thr Sly Lys Glu Asr. Tyr Glu Tyr Ser Tyr 
1535 15SC 1595 HCC 

5 

Glr. Thr Thr He Asn Cys Pro 31 y Thr Gin Fro Fro Ser Val His Pro 
1605 1510 1615 

Cys Tyr Leu Arg Glu Cys Pro Val Ser Ala Thr Trp Arg Val Gly Asr. 
10 " 1620 1625 163C 

Trp Gly Ser Cys Ser Val Ser Cys Gly Val Gly Val Ket Glr. Arg Ser 
1635 1640 1645 

15 Val Glr. Cys Leu Tr.r Asn Glu Asp Gin Fro :ier His Leu Cys His Thr 
16 5 C 16 5 5 1660 

Asp Leu Lys Pre Glu Glu Arg Lys Thr Cys Arc Asn Val Tyr Asn Cys 
16*6 5 1670 1 b 7 5 16cC 

20 

Glu Leu Pre Gin Asn Cys Lys Glu Val Lys Arg Leu Lys Gly Ala Ser 
1685 ' 159C 1695 

Glu Asp Gly Glu Tyr Phe Leu Met He Arg Gly Lys Leu Leu Lys He 
25 1700 17CS 1710 

Phe Cys Ala Gly Met His Ser Asp His Fro Lys Glu Tyr Val Tr.r Leu 
1715 1720 1725 

3 0 Val His Gly Asp Ser Glu Asn Phe Ser Glu Val Tyr Gly His Arg Leu 

173 0 " 173 5 174 0 

His Asn Fro Thr Glu Cys Fro Tyr Asn Gly Ser Arg Arg Asp Asp Cys 
1745 1750 1755 1760 

35 

Glr. Cys Arg Lys Asp Tyr Thr Ala Ala Gly Phe Ser Ser Phe Gin Lys 
1765 1770 1775 

He Arg He Asp Leu Thr Ser Ket Gin He He Thr Thr Asp Leu Gin 
40 173C 1755 179 0 

Fhe Ala Arg Thr Ser Glu Gly His Pro Val Pro phe Ala Thr Ala Gly 
17 95 15 0 0 1 S C 5 

4 r Asr. Cys Tyr Ser Aid Ala Lys Cys : : .. Glr Gly Arj ?hc Ser He Asn 

lalC IS 15 lc2 ; 

L>_-l Tyr G 1 y T.:.r Gly L-u i -:• r ue.. Tr.r ~H . : ■:■ r A 1 ?. Ar. Try He Ser 

ic2r :e:: 1635 ib-i : 

:. 

Gin Gly Asn Tyr Ala Val Ser Asp He Lys Lys Ser Pre Asp Gly Tr.r 
1S4 5 155: H55 

Arc Val Val Gly Lys Cys Gly Gly Tyr Cys Gly Cys Cys Thr Fro Ser 

5 5 1 : 6 C :=:': 1 ? ~* ! 

Ser Gly Tr.r Gly Leu Glu Val Arg Val Leu 



wo hi i ni-4 



42 



PC T I SOU -2 1 22 J 



<22C > 

<2::> ccs 
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<4CC> 14 

g cac act geg g:c ate ag: ctg tgc tec gga a:a atg ggc acg ::c cge 43- 

His Tnr Ala Val lie Ser Leu Cys Ser Gly Met Met Gly Thr Phe Arg 

1 5 10 15 



10 tct cac gat gga gat cat ttc att gaa cca ctg cag tct gtg gat gag 97 
Ser His Asp Gly Asp Tyr Phe He Glu Fro Leu Gin Ser Val Asp Glu 

20 25 30 



caa gag gat gaa gag gaa caa aac aaa ccc :ac att att tat agg cac 145 
15 31 r. Glu Asp Gla Glu Glu 31n Asn Lys Fro His lie lie Tyr Arg His 
3 5 4 0 4 5 



age acc cct cag agg gaa ccc tec aca gga aag cat gec tgt gee acc 193 
ser rnr tro Gin Arg Glu Fro Ser Thr Gly Lys His Ala Cys Ala Thr 
2 0 50 " 55 6C 



tea gaa etc aaa aat ag: rac act aaa gac aag egg aaa ate aga atg 24 1 

Ser Glu Leu Lys Asn Ser His Ser Lys Asp Lys Arg Lys He Arg Met 

6 5 70 7 5 SO 

cga aaa egg aga aag agg aat age ctg get gac gac gtg gca ctg eta 269 

Arg Lys Arc Arg Lys Arg Asn Ser Leu Ala Asp Asp Val Ala Leu Leu 
;=5 " 90 55 



3 0 aag age ggt ttg gca aca aag gtg etc tct ggc tat age aac cag aca 337 
Lys Ser Gly Lei Ala Thr Lys Val Leu Ser Gly Tyr Ser Asn Gin Thr 
10D 1C5 110 



aac aac aca agg gac aga tgg aac cac a3a aga acc aaa cgc ttt ctg 3 = 5 
3 5 Asn Asn Thr Arg Asp Arg Trp Asn His Lys Arg Thr Lys Arg Phe Leu 
115 * " 12Q 125 



tec tac cca egg ttt gta gag gtg atg gtg gtg get gac cac agg atg 435 
Ser Tyr Fro Arg Phe Val Glu Val Met Val Val Ala Asp His Arg Met 
4 0 13 0 13 5 14C 



ctt tta tac cac gga gca aac ctt caa eat tat ate tta acc tta atg 4£l 

Val Leu Tyr His Gly Ala Asn Leu Gin His Tyr He Leu Thr Leu Met 

14 5 15 C 15 5 :ec 

4. z 

tec att gta get tct ate tat aaa gae tea agt att eg =3 aat tta att H3 

Ser lift Val Al a Ser He Tyr Lys Asp Ser Ser He Gly Am Leu He 

:t5 ' h: H5 



lll at: Jt - 3tt gtg Lie tta git gtg -it: ^: ^t g-- cag ga.-. gg.* 
Asn He Val He Val Asn Leu Val val He His Asn Glu Gin 31 j Gly 
15 3 18 5 19 3 



eet tac ata aat ttc aat gec cag aca aca tta aag aac ttt tge cag t'25 
55 Lie Tvr He Asn ?r.e Asr. Ala Sir. Thr Tnr Leu Lys Asn Phe lys Gin 

:?5 :h :c5 
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Thr Leu Gly Leu Ala 31u Leu Gly Thr He Cys Asp Pre Tyr Arq Ser 
24 5 25 C 2 5 5 

tgt ccc a:: age gaa gac sgt ggg ctg ag: aca q:: tec aca aca get £ 1 ~ 
5 Cys Ser He Ser Giu Asp Ser Gly Leu Ser Thr Ala Phe Thr He Ala 
250 265 270 

cac gag ctg ggc cat gig tet aac atg cct cac gat gac age aa: aaa 6£5 
Kis Glu Leu Gly His Val Fhe Asn Met ?ro His Asp Asp Ser Asn Lys 
10 275 230 2S5 

tgc aaa gaa gaa gga gee aag age ccc cag cat gtc atg gca era aca 913 

Cys Lys Glu Glu Gly Val Lys Ser Pre Glr. His Val Ket Ala Fro Thr 
290 295 300 

15 

ctg aac tec tac ac: aac ccc tgg atg tgg tea aag tgc agt egg aaa 551 

Leu Asn Phe Tyr Thr Asn Pro Trp Yet Trp Ser Lys Cys Ser Arg Lys 
3C5 310 315 32 0 

20 tac ate act gag ttc eta gac act ggg tac gga gag tgc teg ctg aat 1009 
Tyr He Thr Glu Phe Leu Asp Thr Gly Tyr Gly Glu Cys Leu Leu Asn 
3 2 5 3 3 0 3 3 5 

gaa cct gca tec agg ace tat cct ttg cct tec caa ctg ccc ggc ctt 105" 7 
2 5 Glu Pre Ala Ser Arg Thr Tyr Fro Leu Pro Ser Gin Leu Pro Gly Leu 

3 4 0 3 4 5 3 5 0 

etc tac aac gtg aat aaa caa tgt gaa ctg att tct ggg cca ggc tct 1105 
Leu Tyr Asn Val Asn Lys Gin Cys Glu Leu Tie Phe Gly Fro Gly Ser 

30 355 360 365 

caa gtg tgc ccc tat atg atg cag tgc aga egg etc tgg tgc aat aat 1153 

Gin Val Cys Fro Tyr Met Met Gin Cys Arg Arg Leu Trp Cys Asn Asr. 

370 375 330 

35 

gtg gat gga gca :ac aaa ggc tgc aag act cag cac acg ccc tgg gca 1201 

Val Asp Gly Ala His Lys Gly Cys Lys Thr Gin His Thr Pro Trp Ala 

335 39C 395 400 

4C gat gga acc gag tgt gag cct gga aag cac tgc aag ttt gga tt: tgt 1245 
Asp Gly Thr Glu Lys Glu Pro Gly Lys His Cys Lys Fhe Gly Fhe Cys 
4C5 410 415 

get. ccc aaa gaa atg gag ggc cct gca ate cat gga tec tgg gga ggt 1297 
45 Val Fre Lys Glu "et Glu Gly Fro Ala He Asp Gly Ser Trp Gly Gly 

4 2 2 12 5 4 11 

1 3? car Hu --c tgc tea .iga acg ege gga gga ggc ace aaa H 4 L ; 

Trp ler His 2 he Gly Thr lys Ser Arg Thr lys Gly Gly Gly He Lys 
r C 4 ] E 44" 44 = 

aca gee ate aga gag tgc aac aga cca gag cca aaa aat ggt ggg aag 1393 

Thr Ala lie Arg Glu Cys Asn Arg Fro Glu Pre Lys Asn Gly Gly Lys 

450 455 4-1 

55 

tac tgt gea gga agg aga atg aag ttc aaa tee tgc aac acg gag ccc 1441 

Tyr lys Val Gly Arg Arg Yet Lys Y'r.e Lys : : e: Cys Asr. Tr.r Clu Pre 

4 4' 4^C ' 
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ttt ccc aag tac age gga att ttg atg aag gac egg tgc aag erg etc 1535 

Phe Pre Lys Tyr Ser Gly He Leu Met Lys Asp Arg Cys Lys Leu Phe 
515 52: 525 

5 

:c: aga gtg cca gga aac aca gee tac tac cag etc cga gac aga gtg 1633 
Cys Arg Val Ala Gly Asr. Thr Ala Tyr Tyr Gin. Leu Arg Asp Arg Val 
53C 535 540 

10 ate gac gga acc cct tgt ggc cag gac aca aat gac ate tgt gtc caa 1681 
He Asp Gly Thr Pro Cys Gly Gin Asp Thr Asn Asp He Cys Val Gin 

545 550 555 560 

ggc ctt tgc egg caa get gga tgc cat cat ate tea aac tea aag etc 1729 

15 Gly Lej Cys Arg Gin Ala Gly Cys Asp His He Leu Asn Ser Lys Val 

56 5 57 C 5^5 

egg aaa gat aaa tgt gag att tgc ggc gga gat aat tct tea tgc aaa :?77 

Arg Lys Asp lvs cys Gly He cys Jly Gly Asp Asn ser Ser Cys Lys 
2C 590 555 5 90 

aca gtg gca gga aca ttt aac act gtc cat tat ggt tac aat act gtc 1325 

Thr Val Ala Gly Thr Phe Asn Thr Val Kis Tyr Gly Tyr Asn Thr Val 
595 60 0 605 

25 

gtc cga att ccg get ggt get acc age ate gac gtg cgt cag cac age 1973 

Val Arg He Pre Ala Gly Ala Thr Ser He Asp Val Arg Gin His Ser 
€13 613 520 

3C tec tea ggg aag tct gag gat gac aac tac eta get tea tea aac age 1321 

Fhe Ser Gly Lys Ser Glu Asp Asp Asn Tyr Leu Ala Leu Ser Asn Ser 
625 63C 635 640 

aaa ggc gaa tec ctg eta aat gga gac ttt gtc gtc tec atg tec aaa 1965 

3E Lys Gly Glu Fhe Leu Leu Asn Gly Asp Phe Val Val Ser Met Ser Lys 

645 650 655 

agg gag gtc cgc gtg egg age gee gtc att cag tac age gga teg gac 2 317 

Arc Glu Val Arg Val Gly Ser Ala Val He Glu Tyr Ser Gly Ser Asp 
4G 66 0 66 5 6 70 

aat gtg tgt gaa aga ctg aac tgt acg gac cgt ate gag gaa gaa ctt 2065 

Asn Val Cys Glu Arg Leu Asn Cys Thr Asp Arg He Glu Glu Glu Leu 
675 650 695 

4 5 

— - cag gtg ctg ::: gtg gga aac ctg tat aac cca gat gtg egg 1111 

Leu Lej Glr. T? al Leu Ser Val Gly Lys Leu Tyr Asn Pre Asp Val Arg 

6 91 6 r 5 " ■: C 

11 - --, c ::i t : z a a: att eer att gag g?. r art*, ctt cag caa t" t. -. r tug 3 
Tyr Ser Phe Asn He Pre He Glu Asp Lys Pre Gin Gin Phe Tyr Trp 
7 j 5 710 "15 "2 0 

aac agt cac ggg ccg tgg caa gca tgc age aag ccc tgc caa ggg gag 22 C 9- 

55 Asn Ser H;g Gly Pre Trr. Sir Alt, Cys Ser Lys Pro Cys Gin Gly Glu 

72 5 " ; 10 7^b 

-;-?. t"" ~- * :i ~: ~- ~.V! ~ ~t - ctt - 2"-~ 
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A: a Cys Gly Thr Asp Cys 

7" : 

gaa tec agt gec cag tgt 

5 3lu Cys Ser Ala Gin Cys 

"95 79: 

tgt gec asa :ac age agg 
Cys Ala Lys Tyr Ser Arg 
10 805 

agt ttc tgt age agt: caa 
Ser Phe Cys Ser Ser Gin 
320 

15 

gga gag cge age aca ggc 
Gly Glu Cys Ser Thr Gly 
625 

20 tec aga age cgt gar ggt 
Ser Arg Ser Cys Asp Gly 
650 

aac ace cgc aat gat gtc 
2 5 Asn Thr Arg Asn Asp Val 
355 8 70 



Asp Leu Arg Trp His Val 
775 76C 

ggt ttg gge tac cgt act 
Gly Leu Gly Tyr Arg Thr 
79E 

a=3 Sac ggg aag acg gag 
Met Asp Gly Ly3 Thr Glu 
610 

ccc aga ecg agt aac cag 
Pro Arg Pre Ser Asn Gin 
625 

gga tag ege tat tea gee 
Gly Trp Arg Tyr Ser Ala 
340 

get ace cae aca aga aga 
Gly Thr His Arg Arg Arg 

555 56 0 

ctg gac gac age aa 
Leu Asp Asp Ser 



Ala Ser Lys Ser 



tta gac ate cae 2401 
Leu Asp lie His 
SCO 

aag gtg gat gac 2449 
Lys Val Asp Asp 
815 

gac aaa tgc tea 2497 
3lu Lys Cys Ser 
830 

tag ace gaa tgt 2 34 5 

Trp Tnr Glu Cys 

345 

gca ate tgt gtc 2503 
Ala He Cys Val 



2625 



<21C> 15 
30 <::il> 674 
<212> FKT 

<213> Mus Tiusculus ADAXTS-9 
<4CC> 15 

35 His Thr Ala Val He Ser Leu Cys Ser Gly Met Met Gly Thr Phe Arg 
15 15 

Ser His Asp Gly Asp Tyr Phe He Glu Fro Leu Git. Ser Val Asp Glu 

20 25 30 

40 

Gin Glu Asp Glu Glu Glu Gin Asn Lys Fro His He He Tyr Arg His 
35 40 45 

Ser Thr Fro Gin Arg Glu Fro Ser "Tr.r Gly Lys His Ala Cys Ala Thr 

4 5 5: 5 5 ez 

Ser 31 u Leu Lys Aon Ser His Ser Lys Asp Lys Arg Lys He Arg >'et 



5 7 -z Lye Arg Lye Arg Asn Ser Leu. AH Aee A ; V H AH : • . Lee 

=5 9: 95 

Lys Ser Gly Leu Ala Thr Lys Val Leu Ser Gly Tyr Ser Asn Gin Tnr 
10 0 1 3 5 HO 

55 

Asn Asn Thr Arg Asr Arc Trp Asn His Lys Arg Thr Lys Arg Phe Leu 
11: 120 12 5 
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Asr. lie Val He Val A sr. Leu Val Val lie His Asn Glu Glr. Glu Gly 
130 15 r 15-0 

5 ?r; Tyr :le Asn ?he Asr. Ala Glr. Thr Tnr Leu Lys Asr. Phe Cys Glr. 
195 20C 205 

Trp Gin His Ser Lys Asr. Tyr Leu Gly Gly He Gin Has Asp Thr Ala 
21C 215 220 

10 

Val Leu Val Thr Arq Glu Asp He Cys Arg Ala Gin Asp Lys Cys Asp 
225 23C 23b 240 

Thr Leu Gly Leu Ala Glu Leu Gly Thr He Cys Asp Pro Tyr Arg Ser 
15 245 250 255 

Cvs Ser He Ser Glu Asp Ser Gly Leu Ser Thr Ala Phe Thr He Ala 
260 265 270 

2 0 His Glu Leu Gly His Val Fhe Asn Met Pro His Asp Asp Ser Asn Lys 

275 280 285 

Cys Lys Glu Glu Gly Val Lys Ser Pro Gin His Val Met Ala Pro Thr 
25C 295 300 

25 

Leu Asr. Phe Tyr Thr Asn Fro Trp Met Trp Ser Lys Cys Ser Arg Lys 
505 31C 315 320 

Tvr He Thr Glu Phe Leu Asp Thr Gly Tyr Gly Glu Cys Leu Leu Asr. 
30 ' 325 33C 325 

Glu Pre Ala Ser Arg Thr Tyr Pre Leu Pro Ser Gin Leu Pro Gly Leu 
340 ~ 345 350 

3 5 Leu Tvr Asn Val Asn Lys Gin Cys Glu Leu He Phe Gly Fro Gly Ser 

355 360 365 

Glr. Val Cys Pro Tyr Xet Xe: Gin Cys Arg Arg Leu Trp Cys Asn Asn 
37C ' 3^5 3B0 

40 

Val Asp Gly Ala His Lys Gly Cys Lys Thr Gin His Thr Pro Trp Ala 
:85 3 50 39 5 4 CO 

Asc Glv Thr Glu Cys Glu Fro Gly Lys Hie Cys Lys Phe Gly Fhe Cys 

.;s .;:e nc -j 15 

Val Prr Lys G 1 u Glu G ", y Pi u AH He A - Gly Ser hp Gly Gly 



.ip Ser His the Glr. Hr Cys Scr Aig . nr Cys G_y G-y -yu 
4 3 5 4 4 C 4-5 5 

Thr Ala He Arg Glu Cys Asr. Arg Pro Glu Pre Lys Asr. Gly Gly Lys 

4 = : 4 5 5 4 6 0 

5 5 

Tvr lys Val Gly Ar~ Arc yet Lys Pne Lys Ser Cys As:: Tnr Glu ?rv 

4 65 4 H- ' 4H 4 = ; 
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Cys Arg Val Ala Gly Asr. Thr Ala Tyr Tyr Gin Leu Arg ^sp Arc Val 
53C = 35 54: 

5 He Asp Gly Thr Fro Cys Gly Gin Asp Thr Asn Asp He Cys Val Gin 
£45 550 555 56C 

Gly Leu Cys Arc Gin Ala Gly Cys Asp His He Leu Asr. 3er Lys Val 
565 570 5^5 

10 

Arg Lys Asp Lys Cys Gly :ie Cys Gly Gly Asp Asr. Ser Ser Cys Lys 
58C * 565 590 

Tr.r Val Ala Gly Thr Phe Asr. Thr Val His Tyr Gly Tyr Asr, Thr Val 
1 5 595 6 CO 605 

Val Arg He Pre Ala Gly Ala Thr Ser lie Asp Val Arg :-ln His Ser 
610 615 620 

2C Fhe Ser Gly Lys Ser Glu Asp Asp Asn Tyr Leu Ala Leu Ser Asn Ser 
625 ' 53C 635 640 

Lys Gly Glu Fhe Leu Leu Asn Gly Asp ?he Val Val Ser Met Ser Lys 
645 650 655 

25 

Arg Glu Val Arg Val Gly Ser Ala Val He Glu Tyr Ser 3-ly Ser Asp 
66C €65 57C 

Asn Val Cys Glu Arg Leu Asn Cys Thr Asp Arg He Glu Hu Glu Leu 
3 0 6 75 6 3 3 66 5 

Leu Leu Gin Val Leu Ser Val Gly Lys Leu Tyr Asn Pre Asp Val Arg 
6SC 595 700 

3 5 Tyr Ser Phe Asr. He Pro He Glu Asp Lys Pro Gin Gin ?he Tyr Trp 
705 71C 715 72C 

Asn Ser His Gly Pre Trp Gin Ala Cys Ser Lys Fro Cys 31n Gly Glu 
725 73C' ^35 

40 

A-rg Arg Pre Lys Leu Val Cys Thr Arg Glu Ser Asp Gin Leu Thr Val 

-740 745 75 0 

Ser Asp Gin A:;- Cvs Asp Am Leu Fr:. Gin Pro Gly Pre VH Thr Glu 

■i5 -IE -6? 765 

Ala Tyr Gly Thr As; Cys Asp L e .. Arg Trp H:s Val a: a Ser Lys Ser 

ii GH Ovi S^: A-j Gin lys Gly Lc Gly Tyi A: g "In: Lu u Asp- I le H;i 
-^3 5 7K 7 55 SCO 

Cys Ala Lys Tyr Ser Arg yet Asp Gly Lys Tr.r Glu Lys Val Asp Asp 
30 5 810 B15 

5E 

Ser Tr.e lys Ser Ser Gin ?r~ Arg Fr: Ser Asn Gin Glu Lys Cys S-r 
E 2 C fi 2 5 3 3 : 
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<2ic> 16 

5 <211> 5 6 65 
c212:» LVJA 

<213^ Hct.o sapiens A3AMTS - 1 3 

<220> 
10 <221> CGS 

<222> (1) . . (32461 

<4C0> 16 

:ca zee acg cct :cc ggt etc aag acg agt tec tec cca gtc egg aga 48 

15 Sei Arg Thr Pro Ser Gly Leu Lys Met Ser Ser Cys ?ro Val Trp Arg 

15 1C 15 

get acg aga teg cct tee :ca ccc gcg tgg ace aca acg ggg car cgc S6 

Ala -*iec Arg Sei f;u Sei ?iu Ala Trp Thr Thr Thr Gly His Cyo 

20 2C 25 30 

tgg :ct tec cgc cac etc etc ccc gga gca gcg ccg egg cac ggg ggc 

Trp Pro Ser Arg His Leu Leu Pre Gl/ Ala Ala Pro Ars His Gly Gly 

35 " 40 45 

25 

cac age cga gtc ccg cct :tt eta caa agt ggc etc gee age ace cac 

Kis Ser Arg Val Fro Pro Leu Leu Glr. Ser Gly Leu. Ala Ser Thr His 

5 0 55 6C 

20 :t: ccg ctg aac ctg ace cgc age tc: cgc cca ctg gca ggg cgc gtc 

?he Leu Leu Aen Leu Thr Arg Ser Ser Arg Leu Leu Ala Gly Arg Val 

€E 70 75 50 

tec gtc gag tac tgg aca egg gag ggc ctg gec tgg cag agg gcg gec 

3 5 Sei Val Glu Tyr Trp Thr Arg Glu Gly Leu Ala Trp Glr. Arg Ala Ala 

85 90 S5 

egg ccc cac tgc etc tac get ggt ca: ctg cag ggc caj gec age age 

Arg Pro His Cys Leu Tyr Ala Gly Kis Leu Gin Gly Glr. Ala Ser Ser 

40 10C 105 110 

tec cat gcg gec ate age acc tgt gga ggc ctg cac ggc ctg ate gtg 

Ser His Val Ala He Ser Thr Cys Gly Gly Leu His Gly Leu He Val 

115 120 125 

4 5 

gca gar gag gaa gag car :tg ate gag ctg ca: ggc ggg ccc a.-.g 

;,:=: Asr. Glu Glu Glu Tyr Leu lie iT.u Pre Leu His Gly Gly Pre Lys 

:3 r . 14: 

:• gg: : c : egg erg g a<g ?3?. ag: gga rca ca: tgt gtg tac a^g eg: 

Gl\ Ser Arg Ser Pro Glu ?lu Ser Gly Pro Hi 5 Cy= Val Tyr Lys Arg 

14= 15: 15 5 ItC 

tec tct ctg cgt cac ccc :ac ctg gac aca gec tgt gga gtg aga gat 525 

^5 £»i Ser Leu Arg His Fro His Leu Asr Thr Ala Cvs GIv Val Arg Asp 
1,5 17C H5 

■ • ; ; : - r •: i \ -j .: j-.: - n r?a tuu c " c ^ a ?c: *. t g si.: cca r ~ ? 



144 



192 



24C 



286 



33* 



364 
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Leu Lys Arg Ser Val Ser Arc Glu Arg Tyr '.'al Glu .hr Met Asp 

210 215 2 2 0 

get gac aag atg atg gtg gec cat cac ggg cgc egg gat gtg gag rag 71 C 

5 Ala Asp Lys Met Met Val Ala Tyr His Gly Arg Arg Asp ''al Glu 31n 

2 2 5 2 3 0 23 5 24 0 

tat gec ctg gec ate atg aac att get gec aaa ctt ttc cag gac teg 758 

Tyr Val Leu Ala He Met Asn He Val Ala Lys Leu Fhe Gin Asp Ser 

10 245 250* 255 

agt etg gga age ace gtt aac ate etc gta act cge etc ate ctg ttc 316 

Ser Leu Gly Ser Thr Val A. sr. He Leu Val Thr Arg Leu :le Leu Leu 

260 26 5 270 

15 

acg gag gae cag ccc act ctg gag ate acc cac cat gec egg aag tec 3£4 

Thr Glu Asp Sir. Pre Thr Leu Glu lie Thr His His ^la Gly Lys Ser 

275 28C 285 

20 eta gac age ttc tgt aag tgg cag aaa tec ate gtg aac cac age ggc 312 

Leu Aep Ser Phe Cys Lys Trp Gin Lys Ser lie Val Asn Pis Ser Gly 

290 255 3CC 

cat ggc aa: gec att cca gag aac ggt gtg get aae eat cac aca gca 9£0 

2 5 His Gly Asn Ala He Pro Glu Asn Gly Val Ala Asn His Asp Thr Ala 

305 31C 515 323 

gtg etc ate aca cgc tat gac ate cgc ate tae aag aac aaa ccc cgc 1008 

Val Leu He Thr Arg Tyr Asp He Cys He Tyr Lys Asn Lys Pre Cys 

2 0 32 5 3 30 22 5 

ggc aca eta ggc ctg gec egg tgg gcg gaa tgt gtg age gcg aga gaa 1056 

Gly Thr Leu Gly Leu Ala Arg Trp Ala Glu Cys Val Ser Ala Arg Glu 

3 4 0 3 4 5 3 5 0 

35 

get gca gcg tea atg agg aca ttg get gec aca age gtt cac cat tec I1C4. 

Ala Ala Ala Ser Met Arg Thr Leu Ala Ala Thr Ser Val His His Cys 

355 3 SO 365 

40 eac gag ate ggg cac aca ttc ggc atg aac eat gac ggc gtg gga aac 1152 

Hie Glu lie 3ly His Thr Fhe Gly Met Asn His Asp Gly Val Gly Asr. 

3 7 0 3 7 5 2 E C 

age tgt ggg gec cut ggt cag gac cca gec aag etc atg get gee cac 12CC 

4 5 Ser Cys Gly Ala Arg Gly 31 n Asp Pro Ala Lys Leu Met /-la Ala His 

3 = 5 3 P : 395 4 : : 

att acc atg aag acc aac era ttc gtg tgg tea tec tgc aac cgt ga~ 1 1 4 H 

He Tnr "vet Lys Thr Asr. Pre Phe Val Trp Ser £ei lys Asn Arg Asp* 

s : 4 : z -, : : 4 : ' 

cac ate aec age ttt eta gac teg gee etc ggg etc tgc ctg aac aac 12336 

Tyr He Thr Ser Phe Leu Asp Ser Gly Leu Gly Leu Cys Leu Asn Asn 

4 2 0 4 2 5 4 3 0 

5 5 

egg ::: ere aga cag gac ttt ctg :ac ccg aca gtg gca ccg ggr caa 1:44 

Arg Pre Fee Arg Glr. Asp Fne Val Tyr Fro Tr.r '.'a 1 A. a Frc Gly Gl:. 

4 3 5 4 4 0 4 4 S 
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egg gga egg ggc gtg aag gec tgc teg etc acg age eta g :g gaa ggc 
3 5 Arg Gly Gly Gly Val Lys Ala Cys Ser Leu Thr Ser Leu A La 31 u Gly 
SIC 615 62 0 



tec cge cca gac acg gtg gac att tgc gtc agt ggc gaa tgc aag cac 

2ys Arg Pro Asp Thr Val Asp He Cys Val Ser Gly Glu Cys Lys His 

£45 ^50 655 

4 5 

gtg gg:: tgc gac ega gtc ctg gec tec ^: ctg eg 2 gag sac* aag tgc 

Val 31 y Cys Asp Arg val Leu Gly Sec Asp Leu Arc Glu \sp Lys Cys 

tt: c>H H: 

H" eg., gr ;; eg- g = c cge ga<- ggc agt gee rg_- gag act arc gag etc 

Arg Val Cys Gly Gly Asp Gly Ser Ala Cys G1j .hr He 31 u Gly va. 

€'-s ttz -: = s 

ttc age cca gee tea cct ggg gee ggg tac gag gat gtc gtc tgg att 

5 5 Phe Ser Fro Ala Ser Fro Gly Ala Gly Tyr Glu Ass Val val Tre He 

€=K t95 "c: 

: r:g~ - c~ «t - - = ~ 3 r - 1 1 C a t e car: grit Ctg cac etc ret etc 



aag age aae egg tgc ate acc aac age ate ecg gee gec gag ggc acg 14S: 

Lys Ser Asr. Arg Cys He Thr Asr ; Ser He Pro Ala Ala Glu Gly Thr 
4 3 5 4SC 

5 

ctg tgc cag acg eac acc ate gac aag ggg tgg tgc tac aaa egg gtc 153= 

Leu Cys Gin Thr His Thr He Asp Lys Gly Trp Cys Tyr Lys Arg Val 

520 505 510 

10 tgt gtc ccc ttt ggg teg cge cca gag ggt gtg gac gga gec tgg ggg 1584 

Cys Val Pre Phe Gly Ser Arg Pro Glu Gly Val Asp Gly Ala Trp Gly 
515 " 52C 525 

ecg tgg act cca tgg gge gac tgc age egg acc tgt ggc 33c ggc gtg 1632 

15 Pre Trp Thr Pro Trp Gly Asp Cys Ser Arg Thr Cys Gly 2ly Gly Val 
S3Z 535 540 

tee tec tct agt cgt cac tgc gac age ecc agg cca acc ate ggg gge 16SG 

3c Ser Sec 3ei Arg His Cys Asp Ser Pre Arg Pre Thr He Gly Gly 
20 545 550 555 550 

aag tac tgt ctg ggt gag aga agg egg eac cge tec tgc aac acg gat 

Lys Tyr Cys Leu Gly Glu Arg Arg Arg His Arg Ser Cys Asn Thr Asp 

5S5 573 575 

25 

gac tgt ccc cct ggc tee eag gac ttc aga gaa gtg eag tg,t get gaa 

Asp Cys Pre Pro Gly Ser Gin Asp Phe Arg Glu Val Gin C/s Ala Glu 

58C 535 530 

30 ttt gac age ate cct ttc cat agg aaa ttc tac aag tgg aaa acg tac 1624 

Phe Asp Ser He Pro Phe Arg Gly Lys Phe Tyr Lys Trp Lys Thr Tyr 
^595 i00 60S 



1725 



1776 



1B72 



ttc aac ttc tac acg gag agg gcg gca gec gtg gtg gac ggg aca ccc 192C 
Phe Asn Phe Tyr Thr Glu Arg Ala Ala Ala Val Val Asp 3 Ly Thr Pro 
40 625 63- 635 64C 



211 2 
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Leu Fro Gly T.ir Pre Gin. Fro His Arg Leu Pro Leu Ala Gly Tn.r -.-.r 

~40 ~45 75j 

tct caa ccg cga cag 35c :;a gac cag gec cag age etc gaa gec ctg 2334 

5 Fhe Gin Leu Arg Gin Gly Fro Asp Gin 7a I Gin Ser Leu Glu Ala Leu 

755 760 765 

gga ccg at: aat gca tec etc ate gtc atg gtg ctg gec egg acc gag 2352 

Gly Pre lie Asn Ala Ser Leu lie Val Met Val Leu Ala Arg Tnr Glu 

10 770 775 780 

ctg ccc gec etc cgc ca; cgc ttc aat gec ccc acc gec cgt gac teg 2430 

Leu Pre Ala Leu Arg Tyr Arg Fhe Asn Ala Pro He Ala Arg Asp Ser 

785 " 79; 79S EDO- 
IS 

ctg ccc ccc tac tec tgj cac tat gcg ccc tag acc aag tgc teg gec 244= 

Leu Pre Pro Tyr 3er Trp His Tyr Ala Pre Trp Thr Lys Cys Ser Ala 

305 s:: S15 

20 cag tgt gca ggc ggt ag: cag gtg cag 905 gtg gag tgc cgc aac cag 2496 

Gin Cys Ala Gly lly Ser Gin Val Gin Ala Val Glu Cys Arc Asn Gin 

820 625 83C 

ctg gac age tec gcg gtc gec ccc cac tac tgc agt gec cac age aag 2544 

25 Leu Asn Ser Ser Ala Val Ala Pro His Tyr Cys Ser Ala His Ser Lys 

83 5 64 C * 345 

ctg ccc aaa agg cag eg: gec tgc aac acg gag cct tgc ect cca gac 2592 

Leu Pre Lys Arg Glr. Arg Ala Cys Asn Thr Glu Pro Cys Pro Pro Asp 

3 0 650 8 55 350 

tgg etc gta ggg aac tgg teg etc tgc age cgc age tgc gat gca ggc 2S4C 

Trp Val Val Gly Asn Trp Ser Leu Cys Ser Arg Ser Cys Asp Ala Gly 

965 37.; 375 83 0 

35 

gtg cgc agt acg teg gt : gtg tgc cag cgc cgc gtc tct gec gcg gag 2558 

Val Arg Ser Thr Ser Val Val Cys Gin Arg Arg Val Ser Ala Ala Glu 

985 BSD 395 

4" 0 gag aag gcg etc gac gac age gca tgc ccg cag ccg cgc cca cct gta 2756 

Glu Lys Ala Leu Asp Asp Ser Ala Cys Fro Gin Pro Arg Pro Pro Val 

90 0 SO 5 91C 



ctg gag gec tgc :ac gg : ccc act tgc ccc ccg gag tgg gca acc etc 

45 Leu Glu Ala Cys His G 1 v ?r: Thr Cys Fr : Pre Glu Trp Ala Thr Leu 

^15 92: c 25 

gac tgg "t gag cgr a:: cca acc tTt 333 cct gg- etc cgc cac cga 

Asp Trp Ser Glu lys Tnr Pre Ser Cys 31 y Pre Gly Leu Arg His Arg 

5 1 = J 9 5 : 9 4 : 

gtg gtc ctt tgt aag agt gca gat caa cga tct act ctg ccc cct ggg 

Val Val Leu Cys Lys Ser Ala Asp Glr. Arg Ser Thr Leu Pro Pro Gly 

94 5 95; 9 55 55 0 

55 

cac tec ctt cct gca ccc aag cca cca tct act atg tgt aac tcq 

His Cys Leu Pro Ala Ala Lys Fro Fro 5er Thr "et Arg Cys Asn Leu 



■54 
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age cac a;: ggc cag c;a tct rga gag tgc act gaa gcc ttg egg ::a 2:72- 
Ser His Thr Gly 31 n Pre 5er Arg Glu Cys Thr Glu Ala Leu Arg Fro 

:o:o ::is ic2c 

5 

tec a:: atg cac cag tgt gag gcc aag tgt gar agt gcc gtg ccg ;cc 3120 
Ser Thr Xet Gin Gin Cys Glu Ala Lys Cys Asp Ser Val Val Fro Pre 
1025 1030 1035 1340 

10 gga gat ggc cca gaa gaa tgc aag gat gtg aac aag gtg get tac tgc 3163 
Gly Asp Gly Pro Glu Glu Cys Lys Asp Val Asn Lys Val Ala Tyr Cys 
1045 105C 1055 

ccc ctg gtg etc aaa ttt cag ttc tgt age cga gcc tac ttc cgc cag 3216 
15 PrD Lea Val Leu Lys Phe Gin Fhe Cys Ser Aug Ala Tyr Fhe Arg Gin 
1C60 1055 1070 

3:3 cgc tgc aaa a:c tgc caa ggc cgc tag ggtacctgga accaacctgg 3 26-; 

'icl C/5 CyS Lya Thr Cys Glw Gly Ai y 
20 1075 1080 

agcacaggct gaggcagggg acatcccact cgagagggca tgagggaaag gggggcttga 332-5 

attgaagggt cagatgeagt tgaaagttat ttattgggta accctacagg gctcccgact ;3Bi 

25 

aagggctgga gaagagcegg ctacccaggg accctctgct gtatcttgee cacttgatag 3446 

tgaagagaga ggactccttg ttgeacacat acttaagtcc ctagcaccce ccccaccctt 3 5 0-5 

30 cgaccg^aat atgtactgtg aagagtgggg gtggggaggg gtgtgctggt ccc:tgccc: 2-. 5 6-5 

:tgcac:gtt ctatccctac accctgagct ggggggaccc atatctgeta -ggggggagc 162-5 

aggect gata ccacctccct gtagccctcc cceagactga cgasggggaa gatccacccc 2 6 5-5 

35 

aacctccgcc ctgcctgccc caggggggag ttcaacatcc aggccgttcc ccatcacggr 3.7 4-5 

:ctacaig:c ctgccctggg ccccacaca: tectcaccaa gaagecttae attaaaaaag 36C-5 

40 ctgtgctatc ctacaaaaaa aaaaaaaaac tegaeggggg gcccggtacc caattcgegr 3S66 

:atag:aaat r.gggtr.tta 36S5 



4 5 -21'J> 1~ 

2 : 1 > 1 : 3 1 

-■2\2> PRC 

-2 13> Y:\~z sar lens A2.AMTS 

c : c 1 2 j > 1 ~ 

Ser Arg Thr Frc Ser Sly Leu Lys Met Ser Ser Cys Pre Val Trr Arc 

15 1C 15 

Ala Met Arg Ser Pre Ser Fro Fro Ala Trp Thr Thr Thr Gly His Cys 
20 25 30 

55 Tro Pro Ser Arg His Leu Leu Pre Gly Ala Ala Fro Arg His Gly Gly 
3 5 4 3 4 5 

His Ser Arg Val Pre Fro Leu Leu Gin Ser Jly Leu Ala Ser T::r Hie 
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Ala 


Asp 


Glu 


Glu 


Glu 




Leu 


He 


Glu 


Fro 


Leu 


His 


Gly 


Gly 


Pre 


Lys 






11C 










135 










14 ; 












Gly 


Ser 


Arg 


Ser 


Pre 


Glu 


Hu 


Ser 


Gly 


Fro 


His 


Cys 


Val 


~y- 


Lys 


Arg 




145 










15 0 










155 










16 C 


5 


Ser 


Ser 


Leu 


Ar 9 


His 


Fr: 


IHs 


Leu 


Asp 


Thr 


Ala 


Cys 


Gly 


Val 


Arg 


Asp 












165 










170 










175 






Gl u 


Lys 


Fro 


Trp 


Lys 


01/ 


Arg 


Pro 


Trp 


Trp 


Leu 


Arg 


Thr 


Leu 


Lys 


Pre 
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185 










190 








Pro 


Pro 


Ala 


Arg 


Pro 


Leu Gly 


Asn 


Glu 


Thr 


Glu 


Arg 


Gly 


Gin 


Pro 


Gly 


10 






195 










2C0 










2C5 










Leu 


Lys 


Arg 


Ser 


Val 


Ser 


Arg 


Glu 


Arc 


Tyr 


Val 


Glu 


Thr 


Met 


Asp 


Val 






210 










215 










220 












Ala 


Asp 


Lys 


Met 


Met 


Val 


Ala 


Tyr 


His 


Gly 


Arg 


Arg 


Asp 


val 


Glu 


Gin 




22 5 










23 C 










235 










24C 


15 


Tyr 


7a 1 


Leu 


Ala 


lie 


Met 


Asn 


He 


Val 


Ala 


Lys 


Le-u 


Phe 


Gin 


Asp 


Ser 












245 










250 










255 






Ser 


Leu 


Gly 


Ser 


Thr 


va 1 


Asn 


He 


Leu 


Val 


Thr 


Arg 


Leu 


He 


Leu 


Leu 










250 










265 










27 0 








Tl.L 


GIl. 


hay 


Gl ; 1 


?:u 


TH 


Leu 


vj 1 


He 


Thr 


H15 


His 


Ala 


Gly 


Lys 


Gar 


20 






275 










2B0 










265 










Leu 


Asp 


Ser 


Phe 


Cys 


Lys 


Trp 


Gin 


Lys 


Ser 


He 


Val 


Asn 


His 


Ser 


Gly 






2 90 










295 










3 00 












K:s 


Gly 


Asn 


Ala 


lie 


Pro 


Glu 


Asn 


Gly 


Val 


Ala 


Asn 


His 


Asp 


Tnr 


Ala 




3 0 5 










310 










315 










32C 


25 


Val 


Leu 


He 


Thr 


Arg 


Tyr 


Asp 


He 


Cys 


He 


Tyr 


Lys 


Asn 


Lys 


Pro 


Cys 












325 










330 










335 






G.y 


Thr 


Leu 


Gly 


Leu 


nl U 


.rvrg 


Trp 


Ala 


Glu 


Cys 


Val 


Ser 


Ala 


Arg 


Glu 










340 










345 










3 5C 








Al_a 


Ala 


Ala 


Ser 


"e: 


Arg 


Thr 


Leu 


Ala 


Ala 


Thr 


Ser 


Val 


His 


His 


Cys 


3C 






355 










36 0 










365 










His 


Glu 


He 


Gly 


His 


Thr 


Phe 


Gly 


Met 


Asn 


His 


Asp 


Gly 


Val 


Gly 


Asn 






37C 










375 










380 












Ser 


Cys 


Gly 


Ala 


Ar 3 


Gly 


Gin 


Asp 


Pro 


Ala 


Lys 


Leu 


Met 


Ala 


Ala 


His 




355 










390 










395 










4 C C 


35 


lie 


Thr 


yet 


Lys 


Thr 


Asn 


Pro 


Phe 


Val 


Trp 


Ser 


Ser 


Cys 


Asn 


Arg 


ASC 












4 0 5 










410 










415 






Tyr 


lie 


Thr 


Ser 


Fhe 


Leu 


Asp 


Ser 


Gly 


Leu 


Gly 


Leu 


Cys 


Leu 


Asn 


Asn 










42 0 










425 










43C 








Arg 


Pro 


Pro 


Arg 


Gin 


Asp 


Phe 


Val 


Tyr 


Pro 


Thr 


val 


Ala 


Pro 


Gly 


Gin 


40 






435 










440 










445 










Ala 


Tyr 


Asp 


Ala 


Asp 


Glu 


Gin 


Cys 


Arg 


Phe 


Gin 


His 


Gly 


Val 


Lys 


Ser 






450 










455 










4 60 












Arg 


Gin 


Cys 


Lys 


Tyr 


Gly 


Glu 


Val 


Cys 


Ser 


Glu 


Leu 


Trp 


Cys 


Leu 


Ser 




4 ■'■ 5 










470 










475 










4 B0 


■i 5 


Lys 


:-e: 


A sr. 


Arc 


"ys 


Tie 




Asn 


Ser 


He 


Pro 


Ala 


Ala 


Hi: 


Gly 


Tnr 






















4 3 0 










4 35 








Cys 


3 1 n 


Th r 


li:s 


Thr 


He 


Asp 


Lys 


Gly 


Trp 


Cy s 


~yr 


Lys 


Arg 


Vli 




Cys 


a 1 


r r 0 


7- he 


-y 


Ser 


3 


:-r; 


Glu 


G -y 


V j . 


A 5 p 


H y 


Al 5 


Trp 


■3 y 








q 1 r; 










5H 










52 3 










F ro 


Trp 




Fro 


Trp 




Asp 


Cys 


Ser 


Arg 


Thr 


Cys 


- - y 


G 1 v 


- - y 


Va 1 






53C 










535 










54 0 












Ser 


Ser 


Ser 


Ser 


Arg 


HIS 


Cys 


Asp 


Ser 


Pro 


Arg 


Pro 


Thr 


He 


Gly 


Gly 




S-,5 










550 










555 










560 


55 


L; '3 


Tyr 


Cys 


Leu 


Gly 


Glu 


Arg 


Arc 


Ar:> 


His 


Arg 


Ser 


Cys 


Asn 


Thr 


Asp 












565 










- 7 " 










5H 






Asp 


Cys 


Ho 


Pro 


3 ^ y 


Ser 


Sin 


Asp 


Ir.c- 


Ar 3 


31 u 


Val 


_■ _ n 


"ys 


Ala 


Glu 



-5: 595 530 



WO nl'l Hl"4 



54 



PC r I SHir 21223 



Val Gly Cys Asp Arg Val Leu 31 y Ser Asp Leu Arg Glu Asa Lvs Cvs 

tec 66 5 " st: 

Arc Val Cys Gly Gly Asp Gly Ser Ala Cys Glu Thr lie Giu Gly Val 
6^5 653 £65 

5 ?he Ser Fro Ala Ser Fro Gly Ala Gly Tyr Glu Aep Val Val Trc lie 
690 635 700 

Pro Lys Gly Ser Val His He Fhe lie Gin Asp Leu Asr. Leu Ser Leu 
705 710 715 72C 

Ser His Leu Ala Leu Lys Gly Asp Gin Glu Ser Leu Leu Leu Glu Gly 
1C 725 730 735 

Leu Pro Gly Thr Pro Gin Pro His Arg Leu Pro Leu Ala C-ly Thr Thr 

740 745 750 

Fhe Gin Leu Arg Glr. Gly Pro Asp Gin Val Gin Ser Leu Glu Ala Leu 
^55 760 765 

15 Gly Pro He Asn Ala Ser Leu He Val Met Val Leu Ala Arg Thr Giu 
77C 775 780 

Leu Pro Ala Leu Arg Tyr Arg Fhe Asn Ala Fro He Ala Arg Asp Ser 
785 79C 795 " * 600 

Leu i- j rc i-ro -yr ter irp r.is iyr Ala ^ro _rp mr Lys Lys zer Ala 
2C 6C5 £10 815 

Gin Cys Ala Gly Gly Ser Gin Val Gin Ala Val Glu Cys Arg Asn Gin 

620 625 83G 

Leu Asp Ser Ser Ala Val Ala Fro His Tyr Cys Ser Ala His Ser Lys 
835 840 845 

2 5 Leu Pro Lys Arg Gin Arg Ala Cys Asn Thr Glu Pre Cys Pro Fro Asp 

85C 85= 860 

Trp Val Val Gly Asn Trp Ser Leu Cys Ser Arg Ser Cys Asp Ala Gly 
865 870 875 Bid 

Val Arg Ser Thr Ser Val Val Cys Gin Arc Arg Val Ser Ala Ala Glu 
30 835 89C 395 

Glu Lys Ala Leu Asp Asp Ser Ala Cys Pro Gin Pre Arg Pro Pro Val 

S0 0 50 5 910 

Leu Glu Ala Cys His Gly Pro Thr Cys Pro Fro Glu Trp Ala Thr Leu 
915 920 925 

3 5 Asp Trp Ser Glu Cys Thr Pre Ser Cys Gly Fro Gly Leu Arg His Arg 

930 935 940 

Val Val Leu Cys Lys Ser Ala Asp Gin Arg Ser Thr Leu Pro Pro G_y 

945 950 955 960 

His Cys Leu Fro Ala Ala Lys Pro Pre Ser Thr Yet Arg Cys Asp. Leu 
40 965 97C " ' 975 

Arg Arg Cys Fro Fro Ala Arg Trp Val Thr Ser Glu Trp Gly Glu Cys 

980 935 99C 

Ser Thr Gin Cys Gly Leu Gly Gin Gin Gin Arg Thr Val Arg Cys Thr 
995 HOC 1CC5 

45 Ser His Thr Gly Gin Fr> Ser Arg Glu Cys Thr Glu Ala Leu Arg Fro 

Ser Thr N^t Glr. Gin Cys Glu A I a Lys Cys Asp Ser Val Val Pro Pro 
ion i<n: ih: :h_. 

Gly Asp Glv Pro Glu Glu Cvs Lvs Asr Val Asn Lvs Val Ala Tvr Cvs 
51 ::4E " * ; :. ; ' ::.h 

Pre Leu Val Leu Lys Fhe Gin Fhe Cys Ser Arg Ala Tyr Fhe Arg Gin 

106 0 1055 H7C 

Yet. Cys Cys Lys Thr Cys Glr. Gly Arc 
1075 105C 
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<40C> 13 

10 a gca gca get: gcg gtg gat gga aca ccc tgc cgc cct gac acg gtg cac 49 

Ala Ala Ala Val Val Asp Gly Thr Pro Cys Arg Pro Asp Thr Val Asp 
1 5 IC 15 

art: tgt etc age cgc gac tgc aag cat gta ggc tgt gac acg etc ctg 9? 
15 He Cys Val Ser Gly Glu Cys Lys His Val Gly Cys Asp Arg Leu Leu 
2 0 2 5 3 0 

ggt tct gat etc cga gag gac aaa tgc cgt gtg tgt ggg ggt gat ggc 145 
Gly Ser Asp Leu Arg (Jiu Asp Lys Cys Arg Vai cys tiiy G-y Asp : Jly 
2 0 35 4C 45 

agt gec tgt gag acc att gaa ggt gtc ttt age cca get ttg cca gga 193 

Ser Ala Cys Glu Thr He Glu Gly Val Fhe Ser Pre Ala Leu Pro Gly 

50 55 €C 

25 

act ggg tat gag gac etc gtc tag ate ccc aaa ggc teg gtc cac att 241 

Thr Gly Tyr Glu Asp Val Val Trp He Fro Lys Gly Ser Val His He 

65 70 75 80 

30 ttc ate caa gat ctg aae ctg tec ctg agt cac ctg gee eta aag ggg 2=9 
Fhe He Gin Asp Leu Asn Leu Ser Leu £er Kis Leu Ala Leu L-/e Gly 
85 90 95 

gac eaa gag tct ctg eta ctg gag ggg eta cct gcg acc eec caa cct 337 
35 Asp Gin Glu Ser Leu Leu Leu Glu Gly Leu Fro Gly Thr Fro Gin Pro 
10C 105 113 

nac cgc ett ccc etc gnt ggg acc aca ttt cat eta egg eag ggg ccg 35 5 
Xaa Arg Leu Pro Leu Xaa Gly Thr Thr Fhe His Leu Arg Gin Gly Pro 
40 115 120 125 

gac eag gca cag age ctg gaa gee ctg gga ccc att aat gea tct etc 433 

Asp Gin Ala Gin Ser Leu Glu Ala Leu Gly Pre lie Asn Ala Ser Leu 

13 5 13 5 1 4 C 

45 

-to ate ate gtg ctg ccc cag cca gag ttg cct get etc cac tae ccc 4 8 1 

Ile lie yet Val Leu Ala Gin Ala Glu Leu Fro Ala Leu His Tyr Arc 

l-'i - 15 0 15 5 H 1 

z- 1 ttt aat ere att get egg gat g-" r: g cct tar t:: - t g g : 

?he Asr. Ala Fro He Ala Arg Asp Ala leu Pro Pre Tyr Ser Trp His 
165 1VC 175 

tat gec ccc tgg acc aaa tgc tea gec cag tgt gca ggc ggc ago cac 57~ 
5 5 Tyr Ala Fro Trp Thr Lys Cys Ser Ala Gin Cys Ala Gly Gly Ser Gin 

IS? 155 1 9 j 
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Cys Asn Thr Glu Pro Cys Frc Pro Asp Trp Val Val Gly Asr. :io Ser 

225 23: 235 241 

cgc :gc age cgt age tgt gac get ggc gtg cgc age cgc tea gtg gtg 

5 Arg Cys Ser Arg Ser Cys Asp Ala Gly Val Arg Ser Arg Ser Val Val 

24= 25C 255 



n c5 



tgc caa cgc egg gtg tct get gea gag gaa aaa gec tta gac gac agt e:7 

Cys Glr. Arg Arg Val Ser Ala Ala Glu Glu Lys Ala Leu Asp Asp Ser 

10 260 265 270 

gec tgt cca cag cca cgc cca cct gtg ctg gag gee tgc caa ggc cca 365 

Ala Cys Pro Gin Pro Arg Pro Pre Val Leu Glu Ala Cys Gin Gly Pro 

275 260 265 

15 

atg tgc cct eet gag tgg gea ace etc gar egg tee cag tgt ace cca 513 

Met Cys Fro Pre Glu Trp Ala Thr Le ^ Asp Trp Ser Glu Cys Thr Pro 
29Z 295 3C0 

20 age tgt ggg eet ggt etc cgc cae cga gtg gtc etc tgt aag agt gea Stl 

Ser Cys Gly Pre Cly Leu Arg His Arg Val Val Leu Cys Lys Ser Ala 
3 05 * 31C 315 3 2 0 

gat caa cga tct art ctg cec cct ggg eac tgc etc cct gea gec aag ICC 9 

2 5 Asp 3ln Arg Ser Thr Leu Pro Pro Gly His Cys Leu Pro Ala Ala Lys 

325 333 335 

cca eca tct act atg cga tgt aac ttg cgc cgc tgc cct cct gec cgc 1C57 

Pro Pro Ser Thr Ket Arg Cys Asn Leu Arg Arg Cys Pro Pro Ala Arg 

3 0 340 345 35 0 

tgg gtg acc agt gag tgg ggc gag tgt tec aea cag tgt ggc etc ggc 11 C 5 

Trp Val Thr Ser Glu Trp Gly Glu Cys Ser Thr Gin Cys Gly Leu Gly 

355 360 365 

35 

cag cag cag ege aca gtg cgc tgc acc age cac aee ggc cag cca tct 1153 

Gin Gin Gin Arg Thr Val Arg Cys Thr Ser His Thr Gly Gin Fro Ser 
3 7 0 3 7 5 3 3 0 

40 cga gag tgc act gaa gec ttg egg cca tec acc atg cag cag tgt gag 12C1 

Arg Glu Cys Thr Glu Ala Leu Arg Fro Ser Thr Met Gin Gin Cys Glu 
365 33C 395 4CC 

gee aac tgt cae agt gtg gtg ccg cct gea gat ggc cca gaa gaa tgc 124; 

4 5 Ala -y= Cys Asp Ser Val Val Pre Frc Civ Asp Gly Frc Glu Clu Cys 

4 0 5 4 11 4 15 

aag gat gtg aac aag gtg get tar tgc ctg gtg etc aaa ttt cag 

Lys Asp Val Asr. Lys Val Ala Tyr Cys Frc Leu Val Leu Lys Frte- Glr. 

i : 4 2? 425 4 : ; 

tte tgt age ega gee tac tte cgc cag atg age tge aaa acc tgc caa 13)5 

Fhe Cys Ser Arg Ala Tyr Fne Arg Gin >'ez Ser Cys Lys Thr Cys Glr, 

4 3 5 4 4 C- 4 4 5 

c z 

ggc cgc tagggtarer ggaarr^acc tggagnrag g-~gaggcag gggararrrc 14,1 
Civ Ar= 
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5 <:ic> 19 
<2i:> 45c 

<212> PRT 

<213> MuS rr.usculus ADAMTS-H 
10 <40C> IS 

Ala Ala Ala Val Val Asp 31y Tr.r Pro Cys Arg Pro Asp Thr Val Asp 
15 10 15 

lie Cys Val Ser Gly Glu Cys lys His Val Gly Cys Asp Arg Leu Leu 
15 20 25 30 

Gly Ser Asp Leu Arg Glu Asp Lys Cys Arg Val Cys Gly Gly Asp Gly 
35 40 45 

2 0 Ser Ala Cys Glu Thr He Glu Gly Val Fhe Ser Pre Ala Leu Pre Gly 

5 0 5 5 6 0 

Thr Gly Tyr Glu Asp Val Val Trp He Fro Lys Gly Ser Val His He 
£5 * 7C * 75 SC 

25 

Phe He Gin Asp Leu Asr. Leu Ser Leu Ser His Leu Ala Leu Lys Gly 

65 90 

Asp Gin Glu Ser Leu Leu Leu Glu Gly Leu Pre Gly Thr Fro Gin Pro 
30 * ICO 105 110 

Xaa Arg Leu Pro Leu Xaa Gly Thr Thr Phe His Leu Arg Gin Gly Pro 
115 120 125 

3 5 Asp Gin Ala Gin Ser Leu Glu Ala Leu Gly Pre He Asr. Ala Ser Leu 

'l20 135 14 0. 

He He Me: Val Leu Ala Gin Ala Glu Leu Pro Ala Leu His Tyr Arg 
145 ISC 155 16C 

40 

Phe Asn Ala Pro He Ala Arg Asr Ala Leu Pre Pro Tyr Ser Trp His 
16S 170 H5 

Tyr Ala Frc Trp Thr Lys Cys Ser Ala Gin Cys Ala Gly Gly Ser Gin 

4 5 UC 16 5 :=o 

VH Zlr. Val Val Glu Cys Arg Asn Gin Asp Ser Ser AH Va AH 



5 : H- c- l-"i -*v£ i'.-r Glj H 1 5 Lys L-:-- 1- r Lys Arg J.n Arg A . ;= 

2i: 215 220 

Cys Asn Thr Glu Pre Cys Fro Fro Asp Trp Val Val Gly Asn Trp Ser 

225 23 C 235 24 C 

5 5 

Aru Cys Ser Ar= Ser Cys Asp Ai a Gly Val Aig Ser Arg Ser Val Val 
245 2H 255 



WO ill ■■1111-4 



5S 



P( I I Slid 21223 



Ser Cys Gly Pro Gly Leu Arg His Arg Val Val Leu Cys Lys Ser Ala 
3C5 3 1 j 315 32 1 

5 Asp Gin Arg Ser Thr Leu Pre Pre Gly His Cys Leu Fro Ala Ala Lys 
325 33C 335 

Pro Pre Ser Thr Met Arg Cys Asn Leu Arg Arg Cys Pro Pre Ala Arg 
340 ' 345 " 350 



10 

Trp Val Thr Ser Glu Trp Gly Glu Cys Ser Thr Gin Cys Gly Leu Gly 

355 360 365 

Gin Gin Gin Arg Thr Val Arg Cys Thr Ser K13 Thr Gly Gin Pre Ser 

15 37C 375 360 

Arg Glu Cys Thr Glu Ala Leu Arg Pre Ser Thr Ket Gin Gin Cys Glu 
385 330 335 4 C C 

2 0 Ala Lys Cys Asp Ser Val Val Pro Pre Gly Asp Gly Pro Glu Glu Cys 
405 410 415 

Lys Asp Val Asn Lys Val Ala Tyr Cys Pro Leu Val Leu Lys Phe Gin 
420 425 430 



25 



Phe Cys Ser Arg Ala Tyr Fhe Arg Gin Met Ser Cys Lys Thr Cys Gin 
435 440 445 



G-ly Arg 
3C 450 



<210> 20 

35 <211> 1603 

•:213> Korr.c sapiens ADAMTS-R1 

<220> 

4C <221s CDS 

<222> : 5 1 ) . . ( 1634 ) 

■:4 0C> 2 0 

gaattcrgra cgaggcagtg tergal -erg artceggeaa ggareeaage atg gaa 55 
4 5 Ket Glu 



tge eg-, e^g gea 3:: err ggr. a:a erg ' • :t: ttt rig g — err 1 :: 4 

Cys O/i Arg Arg Ala Thr F re C-_y Thr Le^ Leu Le j Phe Le*u Ala Phe 

- " 5 : 1 15 

ctg err erg agt rce agg are gca cgr tec gag gag gae egg gar. gge 152 

Leu Leu Leu Ser Ser Arrg Thr Ala Arg Ser Glu Glu Asp Arg Asp Gly 

2C 25 30 

5 5 

rra t gg gat gee egg ggc eea t gg agt gaa tgc tea rgr are tgr -gg 2 0 0 

Leu Trp Asp Ala Trp Gly Fie Trp Ssr G^e Cys Ser Arg T:;r Cys Gly 
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;ca gaa gca ggt qa: etc cga get cag caa 15= tea get cat aat gat 1-4 -J 

Pro Glu Ala Glv Asp Fhe Arq Ala Glr. Glr. Cys Ser Ala His Asr. Asp 

35 5B 

5 

gtc sag cac cat ggc cag ::t ta: gaa tgg ett cct gtg tct aat gac :*?2 

Val Lys Hi 3 His Gly Gin Phe Tyr Glu Trp Leu Fro Val Ser Asr. Asp 

100 " 1C5 li: 

10 cct gac aac cca tgt tea etc aag tgc caa gec aaa gga aca acc etc 44C 

Pro Asp Asn Pro Cys Ser Leu Lys Cys Gin Ala Lys Gly Tnr Thr Leu 

115 * 120 125 13C 

gtt gtt gaa eta gca cct aag gtc tta gat ggc acg cgt tgc tat aca 
15 Val Val Glu Leu Ala Fro lys Val Leu Asp Gly Thr Arg Cys Tyr Chi- 
lis 14 0 14 5 

gaa tct ttg gat atg tgc ate agt ggt tta tgc caa att gtt ggc tgc 516 

Glu ser Leu Asp Met Lys He Ser Hy Leu Cys Hn He Val Gly Cya 

20 150 155 160 

gat cac cag ctg gga age acc gtc aag gaa cat sac tgt egg gtc tgc S'c-i 

Asp His Gin Leu Gly Ser Thr Val Lys Glu Asp Asn Cys Gly Val Cys 

165 17C 175 

25 

aac gga gat ggg tec acc tgc egg erg gtc cga ggg cag tat aaa tec 6:2 

Asn Gly Asp Gly Ser Thr Cys Arg Leu Val Arg Gly Gin Tyr Lys Ser 

130 165 190 

30 cag etc tec gca acc aaa teg gac gac acc gtg gtt gca att cce tat C50 

Gin Leu Ser Ala Thr Lys Ser Asp Asp Thr Val Val Ala He ?rc Tyr 

1S5 20C 20b 210 

gga agt aga cat att cgc etc gtc tta aaa ggt cct gat cac tta tat 725 

3 5 Gly Ser Arg His He Arc Leu Val Leu Lys Gly Fro Asp His Leu Tyr 

215 22 0 225 

etc gaa acc aaa acc etc cag ggg act aaa ggt gaa aac agt etc age 7~'£ 

Leu Glu Thr Lys Thr Leu Gin Gly Thr Lys Gly Slu Asn Ser Leu Ser 

40 230 235 240 

tec aca gga act ttc etc gtg gac aat tct agt gtg gac ttc cag aaa =1:4 

Ser Thr Gly Thr Phe Leu Val Asp Asn Ser Ser Val Asp ?he Gin Lys 

24r 25C 255 

* CI 

■ ; 

etc :j c gac aaa gag at a ctg aga «tg get 93a zzu etc ac-a gca gee 

Fhe Fro Asp Lys Glu lie Leu Arg Met Ala Gly Frc Leu Thr Ala Asp 

2iz 26?. 

c J ttc ett gtc aag att cgt aac teg ggc tec get gar cgt gtc rag "-. ■ 

Phe He Val Lys He Arg Asn Ser Gly Ser Ala Asp Ser Tnr Val Gin 

175 ' 2BZ 25: 

ttc ate ttc tat caa cec ate ate cac cga tgc agg gag aeg gat etc l"i S 

55 Fhe He Phe Tvr Gin Fro He He His Arg Tn: «rg Glu Thr Asp Fhe 

29: itC 305 
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Hie Tyr Tyr Pre Glu Asr. He Lys Fro Lys ?r: Lys leu Gin Glu Cys 

j-;: ' 3-4 5 35 C 

iac ttg gat cc: tgt c;a cc; agt gac gga :a: aag cag arc atg cct 

5 Asn leu Asp ?-° -V s Frc Ala 3er As P -'-I' 7 Y r L >' 5 01 r - :le Mec prc 

355 360 365 3"C 

tat gac etc tac cat ccc ctt cct egg tgg gag gec acc cca tgg acc 

Tyr Asp Leu Tyr His Pro leu Prc Arg Trp Glu Ala Thr Pro Trp Thr 

10 ' ' 375 330 385 
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gcg tgc tec tec tec tgt ggg ggg ggc ate tag age egg gca gtt tec 

Ala Cys Ser Ser 3er Cys Gly Gly Gly He Gin Ser Arg Ala Val Ser 

3SC 395 400 

15 

tgt gcg gag gag gac ate cag ggg cat gtc act tea gtg gaa gag tgg 13C*, 

Cys Val Glu 3Iu Asp lie Gin Gly His val Thr Ser Val Glu Glu Trp 

405 410 415 

20 aaa tgc atg tac acc cct aag atg ccc ate gcg cag ccc tgc aac act 1351: 

lys Cys Met Tyr Thr Prc Lys Met Pro He Ala Gin PrD Cys Asn He 

42C 425 430 

ttt gac tgc cct aaa tgg ctg gca cag gag tgg tct ccg tgc aca gtc 14C0 

2 5 ?he Asp Cys Pro Lys Trp Leu Ala Gin Glu Trp Ser Fid Cys Thr Val 

435 440 445 450 

acg tgt ggc rag ggc etc aga tac cat gtg gtc etc tgc ate gac cat 

Thr Cys Gly 31r. Gly Leu Arg Tyr Arg Val Val Leu Cys lie Asp Mis 
30 455 460 465 

cga gga atg :ac aca gga ggc tgt age cca aaa aca aag zee cac ata 

Arg Gly Met His Thr Gly Gly Cys Ser Fro Lys Thr Lys Fro His He 

470 475 4S0 

35 

aaa gag gaa tgc ate gca ccc act ccc tgc tat aaa ccc aaa gag aaa 

lys Glu Glu Cys He Val Fro Thr Prc Cys Tyr Lys Pre Lys Glu Lys 

485 450 495 

40 ctt cca gtc = ag gec aag ttg cca tgg ttc aaa caa get :aa gag eta 

leu Pro Val Glu Ala Lys Leu Pre Trp Phe lys Gin Ala Gin Glu leu 

E'CC 505 510 

gaa gaa gga get get gtg tea gag gag ccc teg taa gtt gta 

45 Glu Glu Gly Ala Ala Val Ser Glu Glu Fro Ser Val Val 

515 5 2: 523 
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Ser Gin 
155 
Tyr Gly 
21C 

Tyr Leu 

Ser Ser 
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Asp Fhe 
275 
Glr. Fhe 
230 

?he Phe 

Ala Glu 

Cys Hi a 

Cys Asr. 

3 55 
Fro Tyr 
370 
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Met 

Trp 

3 

VH Ser Cys 

Glu Trp Lys 

Asr. He Fhe 
435 

Thr V.*l Thr 
450 

•Hs Arg 



Asp 
465 
5 5 Hie 
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Gly Ala 

Gly Arg 

Glu Ala 
65 

Lys His 

100 

Asp Asr. 

Val Glu 

Ser Leu 

His Gin 
165 
Gly Asp 
1B0 

Leu Ser 

Ser Arg 

Glu Thr 

Thr Gly 
245 
Fro Asp 
260 

He Val 

He Phe 

Pro Cys 

Cys Tyr 
325 
Tyr Tyr 
34 0 

Leu Asp 
Asp Leu 
Cys Ser 

Val Glu 

Cys Me: 

4 : ■: 

Asr Ove 

Cys Gly 

Gly Met 

Glu Glu 
4 

Pre val 



Ala Asr. 
5 5 

Asr He 

7C 

Gly Asp 

His Gly 

Pre Cys 

Leu Ala 
13 5 
Asp Met 
15C 

Leu Gly 

Gly Ser 

Ala Car 

His He 
215 
Lys Thr 
230 

Thr Phe 

Lys Glu 

Lys He 

Tyr Gin 
295 
Ser Ala 

310 

Asp Leu 

Pro Glu 

Fro Cys 

Tyr His 
375 
Ser Ser 

3?: 

Glu Asp 

Tyr Tr.r 

Fro Lys 

Gin Gly 
4 55 
K:s Thr 
4 "0 

Cys He 
Glu Ala 



Ser Leu Arg 
Arg Tyr Arg 

Fhe Arg Ala 

90 

Gin Phe Tyr 
105 

Ser Leu Lys 
12 0 

Fro Lys Val 
Cys He Ser 
Ser Thr Val 

170 

Thr Cys Arg 
185 

Lys Ser Asp 

200 

Arg Leu Val 

Leu Gin Gly 

Leu Val Asp 
250 

He Leu Arg 
265 

Arg Asn Ser 
ISO 

Pre He He 
Thr Cys Gly 

Arg Ser Asn 

330 

Asn He Lys 
34E 

Pre Ala Ser 
160 

Pro Leu Fro 
Cys Gly Gly 
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^5 
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235 
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K_s 
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315 
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Arg 
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3 95 
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4,0 

Leu Arg Tyr 
Gly Gly Cys 
Val Fro Thr 

4 0 

Lvs Leu Tro 
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4^5 



3ys 
60 
:ys 

Gin 

Trp 
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140 
Leu 

Glu 

Val 

Thr 

Lys 
22 0 
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Ser 

Ala 

Ser 

Arg 
3CC 
Gly 

Val 

Lys 

Gly 

Trp 
3SC 
He 

'.'a 1 

Ilc- 
Cl 
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it : 

Pre 
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Leu Ser 

Ser Asn 

Cys Ser 

Leu Pro 
11c 

Ala Lys 
125 

Gly Thr 

Cys Gin 

Asp Asn 

Arg Gly 
190 
Val Val 
205 

Gly Pro 

Gly Glu 

Ser Val 

Gly Pro 
2 70 
Ala Asp 
285 

Trp Arg 

Tyr Gin 

Val Ala 

Pro Lys 
350 
Tyr Lys 
365 

Giu Ala 
Gin Ser 

Thr Ser 
Ala Glr. 

4 :■ ; 
Trp Ser 
4 4 5 

Val Leu 
Lys Thr 
Tyr Lys 
Lvs Gin 



Ser Lys 

Vdl Asp 
60 

Ala His 
95 

Val Ser 

Gly Thr 

Arg Cys 

He Val 
160 
Cys Gly 
175 

Glr. Tyr 

Ala He 

Asp His 

Asn Ser 
240 
Asp Phe 
255 

Leu Thr 

Ser Thr 

Glu Thr 

Leu Thr 
320 
Asp Gin 
335 

Leu Gin 
Gin He 
Thr Pro 
Arg Ala 

Val Glu 

4 : 5 

Prt "y 

Prt Cys 

Cys lie 

Lys Pre 
4 ~ : 
Fro Lys 
4 H 

Ala Glr. 
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<i2 13> H;-.c sapiens ALLAY 7 S - 5 
< 4 C C > 2 2 

Gly His Leu leu Gly Leu Ser His Asp Asp Ser Lys Phe Cys Glu Glu 
5 1 5 1C 15 

Thr ?he Gly Ser Thr Glu Asp Lys Arc Leu Met Ser Ser lie Leu Thr 

2 0 2 5 3 0 

10 Ser He Asp Ala Ser Lys Pro Trp Ser Lys Cys Thr Ser Ala Thr lie 
35 40 45 

Thr Glu Fhe Leu Asp Asp Gly His Gly Asa Cys Leu Leu Asp Leu Pro 
5 0 5 5 6 G 

15 

Arg Lys Gin lie Leu Gly Pro Glu Glu Leu Pro Gly Gin Thr Tyr Asp 
65 70 ~5 SO 

Ala Thr Gin Gin Cys Asn Leu Thr Phe Gly Fro Asp Tyr Ser Val Cys 
20 85 90 S5 

Pre Gly Xaa Asp Val Cys Ala Arg Leu Trp Cys Ala Val Val Arg Gin 

100 105 110 

2 5 Gly Gin Met Val Cys Leu Thr Lys Lys Leu Fro Ala Val Glu Gly Thr 
115 * 12C 125 



Pre Cys Gly Lys Gly Arg lie Cys Leu Gin Gly Lys Cys Val Asp Lys 



135 



30 



Thr Lys Lys Lys Tyr Tyr Ser Thr Ser Ser His Gly Asn Trp Gly Ser 



145 



150 



Trp Gly Ser Trp Gly Gin Cys Ser Arg Ser Cys Gly Gly Gly Val Gin 
35 165 170 175 

Phe Ala Tyr Arg His Cys Asr. Asn Pre Ala Pro Arg Asn Asn Gly Arg 
180 185 190 

4 0 Tyr Cys Thr Gly Lys Arc Ala He Tyr His Ser Cys Ser Leu Met Pro 

195 2 CO 205 

Cys Pro Pro Asn Gly Lys Ser Phe Arg His Glu Gin Cys Glu Ala Lys 

21C ' 215 220 

Asn Gly Tyr Gin Ser Asp Ala Lys Gly Val Lys Thr Phe Val Glu Trp 



v . a^ 



\sv Val Cvs Lvs Leu Thr 



i a Ly 3 G _ y . n r 
2 5 0 



Tyr Val Val Fhe Ser Pro Ly; 
265 27C 
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Val Arg lie Fro Glu Gly Ala Thr His lie lys Val Arc Gin ?he Lys 
34 C 34 5 3 5C 

Ala :vs Asp Gin Thr Arg Fhe Thr Ala Tyr Leu Ala Leu Lys Lys Lys 
5 ' 355 36C 365 

Asn Gly Glu Tyr Leu He Asn Gly Lys Tyr y.et He Ser Thr Ser Glu 
370 3~5 330 

10 Thr He He Asp lie Asn Gly Thr Val Mec Asn Tyr Ser Gly Trp Ser 
38S 390 395 4C0 

His Arg Asp Asp Fhe Leu Kis Gly Met Gly Tyr Ser Ala Thr Lys Glu 
405 41C 415 

15 

:le Leu He Val Gin He Leu Al3 Thr Asp Pro Thr Lys Pre Leu Agp 
4 2 0 4 2 5 4 3 0 

val Arg Tyr Ser Fhe Fhe Val Pio Lyt Lys» Sei T:.i Flo Lys Val Asn 
20 435 440 445 

Ser Val Thr Ser His Gly Ser Asn Lys Val Gly Ser His Thr Ser Gin 
450 455 460 

2 5 Fro Gin Trp Val Thr Gly Frc Trp Leu Ala Cys Ser Arg Thr Cys Asp 
465 470 475 48C 

Thr Gly Trp His Thr Arc Thr Val Gin Cys Gin Asp Gly Asn Arg Lys 
485 490 455 

30 

Leu Ala Lys Gly Cys Fro Leu Ser Gin Arg Fro Ser Ala Fhe Lys Gin 
5CC * 505 510 

Cys Leu Leu Lys Lys Cys 
35 J 515 



c210> 23 

<211> 3 409 

4 0 .-.?.12> DMA 

■:213> Kcrr.o sapiens ATAyT3-]0 

<220> 

■ :221> CCS 

45 :.222 > ;251 . . ■ 324:: 



35 



■.4:j> 2 3 

tea r^r ■:: t r regt rt cnag atg agl t re t^t gtc t jg ijj g:t 

Ser Ser Zys Frt '.'a I Irp Arg A .3 



arg aga erg cct :c: oca ::c gcg tgg ace aca a eg ggg eac tgc tgg 
■'.e: Arg Ser Frc Ser Fro Fro Ala Trp Thr Thr Thr Gly His Cys Trp 
10 15 22 25 

ect tet cgc car etc etc rcr :;ga gea gcg ccg rgg car ggg 33- : 1 " 

Frc Ser Arg His Leu Leu Fro jly A 1 3 Ala Frc Arg Hie Gly Gly Kir 

3 I " = 4 " 
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gtg gag tac tgg ac= egg gag cge ccg gee teg cag agg gcg gee egg 25: 

Val Glu Tyr Trp Thr Arc Glu Gly Leu Ala Trp Glr. Arg Ala Ala Arc 

7 5 3 C E 5 

5 

ccc :ac tgc etc :ac get ggt cac ctg cag ggc tag gec age age :c; 5 5 & 

?rc His Cys Leu Tyr Ala Gly His Leu Glr. Gly Gin Ala Ser Ser Ser 

9C 95 100 105 

10 cac gtg gec ate age acc tgt gga ggc ctg cac ggc ccg arc gtg gca 357 

His Val Ala He Ser Thr Cys Gly Gly Leu His Gly Leu He Val Ala 

110 * 115 120 

gac gag gaa gag tac ctg att gag :cc ctg cac ggt ggg ccc aag ggt 435 

15 Asp Glu Glu Glu Tyr Leu lie Glu Pro Leu His 31y Gly Pro Lys Gly 

125 13 0 135 

tct ccg age ccg gag gaa age gga cca cat gtg gcg tac aag cgt tec 4=3 

Ser Arg Ser Pre Glu Glu Ser Gly Fro His Val Val Tyr Lys Arg Ser 

20 140 145 150 

tct ctg cgt cac ccc cac ctg gac aca gec tgt gga gtg aga gac gag 53 1 

Ser Leu Arg His Pre His Leu Asp Thr Ala Cys Gly Val Arg Asp Glu 

155 160 155 

25 

aaa ccg tgg aaa ggg egg cca tgg egg ctg egg ace ttg aag cca ccg 579 

Lys Pro Trp Lys Gly Arg Fro Trp Trp Leu Arg Thr Leu Lys Pro Pro 

170 * 17l 190 165 

30 cct gec aga ccc ctg ggg aat gaa aca gag cgt ggc cag cca ggc ccg €27 

Pro Ala Arg Pre Leu Gly Asr. Glu Thr Glu Arg Gly Gin Pro Gly Leu 

19C 195 200 

aag eg a teg gtc age cga gag cge tac gtg gag acc ctg gtg gtg get 675 

3 5 Lys Arg Ser Val Ser Arg Glu Arg Tyr Val Glu Thr Leu Val Val Ala 

205 " 210 215 

gac aag atg atg gtg gec tat cac ggg cge egg gat gtg gag cag tat 723 

Asp Lys Met Yet Val Ala Tyr His Gly Arg Arg Asp Val Glu Gin Tyr 

40 220 225 230 

gtc ctg gec ate atg aac att get gee aaa ctt tec cag gac teg act 771 

Val Leu Ala lie Met Asr. He Val Ala Lys Leu Phe Gin Asp Ser Ser 

23 5 240 24 5 

j_ - 

ctg gga age acc git aac ate etc gta act cge ct: ate ctg etc a eg r 1 

Leu Gly Ser Thr Val As:: He Leu Val Thr Aig Leu He Leu Lc ... Thr 

LEG 2 5 5 25 1 H 5 

Jig J a-- -cc act ctg gjg i:. -a- g~ L-jJ ^- : J - — - - ' 

Glu Asp Glr. Pre Thr Leu Glu He Thr His His Ala Gly Lys Ser Leu 

2 7C 2H 25: 

gac age ttc tgt aag tgg cag aaa tec ate gtg aac cac age ggc eat 515 

c 5 Asp Ser Fhe Tvs Lys Trp Gin Lvs Ser He Val Asr. His Ser Gly H: = 

2E5 253 2?S 
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Thr leu Gly Leu Ala Arg Trp Ala Glu Cys Val Ser Ala Arc Glu Ala 
3 3 C 3 3 5 ' 340 " 345 

gca gec tea aeg agg aca ctg get gee aca age get eae cat tgc cac Lie 7 
5 Ala Ala Ser Met Arg Thr Leu Ala Ala Thr Ser Val His His Cys His 
353 355 36C 

gag ate ggg cac aca ttc ggc atg aac cat gae ggc gtg gga aae age 1155 
Glu lie Gly His Thr Phe Gly Met Asn His Asp Gly Val Gly Asn Ser 
10 365 37C ' 375 

tgt ggg gee cat ggt eag gae eca gee aag ete atg get gee cac att 1205 

Cys Gly Ala Arg Gly 31n Asp Pro Ala Lys Leu Met Ala Ala His He 

363 3 85 3 9 0 

15 

ace atg aag acc aac eca ttc gtg tgg tea tec tgc aac cgt gae cac 1251 

Thr Met Lys Thr Asn Pre Phe Val Trp Ser Ser Cys Asr. Arg Asp Tyr 

395 4 c C 405 

20 ate ace age ttt eta gae teg gge ctg ggg etc tgc ctg aac aae egg 1259 
lie Thr Ser Phe Leu Asp Ser Gly Leu Gly Leu Cys Leu Asn Asn Arg 
410 415 420 425 

cee ccc aga cag gae ttt gtg tae ccg aca gtg gca ccg ggc caa gee ' 1347 

2 5 Pre Pro Arg Glr. Asp Phe Val Tyr Pro Thr Val Ala Fro Gly Gin Ala 

430 435 44C 

tae gat gca gat gag caa tgc cgc ttt eag cat gga gtc aaa teg cgt 13S5 
Tyr Asp Ala Asp Glu Gin Cys Arg Phe Gin His Gly Val Lys Ser Arg 

3 0 44 5 45 C 4 55 

cag tgt aaa tae ggg gag gtc tgc age gag ctg tgg tgt ctg age aag 1443 

Gin Cys Lys Tyr Gly Glu Val Cys Ser Glu Leu Trp Cys Leu Ser Lys 

46 0 46 5 4 70 

35 

age aac egg tgc ate ace aac age ate ccg gec gee gag ggc aeg ctg 14S1 

Ser Asn Arg Cys He Thr Asn Ser lie Pro Ala Ala Glu Gly Thr Leu 

475 480 465 

40 tgc cag aeg cac acc ate gae aag ggg egg tgc tac aaa egg gtc tgt 1533 
Cys Glr. Thr His Thr lie Asp Lys Gly Trp Cys Tyr Lys Arg Val Cys 
493 495 50 0 5 05 

gtc ccc ttt ~gg teg ege tea gag ggt gtg gae gga gec tgg ggg ccg 15" 

4 5 Val Pre Fee Gly Ser Arg Frc Glu Gly Val Asp Gly Ale Trp Gly ?r: 

5 : C 515 5 2 0 

teg act era tgg gge ^: tgc age egg vec tg- ggc gge gg: gtg tee If 3 ' 
Trp Thr Frc Trp Gly Asp Cys Ser Arg Thr Cys C-ly Gly Gly 7 a 1 Ser 
51 525 5^C 5 35 

tct tct agt cgt cac tgc gae age ccc agg eca ace ate ggg ggc aag 16 5 3 
Ser Ser Ser Arg His Cys Asp Ser Pre Arg Pro Thr He Gly Gly Lys 
540 545 55C 

tac : gt ctg ggt gag jga a g g egg cui cgc t ;• tgc ecu aeg gut g a c 1 " 3 1 
Tyi Cys Leu Gly Glu Arc Arg Arg His Arg Ser Cys Asr. Thr Asp Asp 
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gga ggg ggc gtg aag gee tgc teg etc acg age eta gcg gas ggc tte 1375 

Gly Gly Gly Val Lys Ala Cys Ser Leu Thr Ser Leu Ala Glu Gly Fhe 

605 610 515 

5 

aac tte tac acg gag agg gcg gca gee gtg gtg gac ggg aca eec tgc 1523 

Asn Phe Tyr Thr Glu Arg Ala Ala Ala Val Val Asp Gly Thr Fro Cys 

620 525 630 

1C cgt cca gac acg gtg gac att tgc gtc act ggc gaa tgc aag cac gtg 1911 

Arg Pro Asp Thr Val Asp lie lys Val Ser Gly Glu Cys Lys His Val 

635 640 645 

ggc tge gac cga gtc ctg ggc tec gac ctg egg gag gac aag tgc cga 2019 

15 Gly Cys Asp Arg Val Leu Gly Ser Asp Leu Arc Glu Asp Lys Cys Arg 

650 £55 66 0 €65 

gtg tge ggc ggt gac ggc agt gec tge gag acc ate gag ggc gtc ttc 2C5 7 

Val Cys Gly sly Asp Gly Ser Ala Cys Glu Thr He Glu Gly Val Phe 

20 670 €75 SBO 

age cca gec tea cct ggg gec ggg tac gag gat gtc gtc tgg att ccc 211b 

Ser Pre Ala Ser Fro Gly Ala Gly Tyr Glu Asp Val Val Trp He Fro 

S35 690 6S5 

25 

aaa gge tec gtc cac ate ttc ate cag gat ctg aac etc tct etc agt 2163 

Lys Gly Ser Val His He Phe He Gin Asp Leu Asn Leu Ser Leu Ser 

700 705 * 710 

30 cac ttg gec ctg aag gga gac cag gag tee ctg ctg etg gag ggg ctg 2IU1 

His Leu Ala Leu Lys Gly Asp Gin Glu Ser Leu Leu Leu Glu Gly Leu 

715 720 72 5 

cct ggg acc ecc cag ccc cac cgt ctg cct eta get ggg acc acc ttt 2253 

3 5 Pro Gly Thr Pre Gin Pre His Arg Leu Pre Leu Ala Gly Thr Thr Phe 

730 ' 735 740 745 

caa ctg cga cag ggg eca gac cag gtc cag age etc gaa gec etc gga 230"" 

Gin Leu Arg Gin C-ly Pre Asp Gin Val Gin Ser Leu Glu Ala Leu Gly 

40 750 755 760 

ccg att aat gca tct etc ate gtc atg gtg ctg gec egg acc gag ctg 2355 

Fro Tie Asn Ala Ser Leu lie "al Met Val Leu Ala Arg Thr Glu Leu 

755 77 0 " > "'E 

45 

cct gc: etc cgc tac ege ttc aat gee ccc ate gec cgt gac teg ctg 2 103 

Fie Al s Leu Arg Tyr Arg Phe Asn Ala Pre He Ala Arg Asp Ser Leu 

"3: n ? e " : 

5, e-j cc; tue tee :gg eae tat geg cue tug aec aag tg_ teg gee e-g; ~ 4 - 1 

Pro Pre Tyr Ser Trp His Tyr Ala Pre Trp Thr Lys Cys Ser Ala Gin 

795 3C0 8C5 

tge gca ggc ggt age cag gtg cag gcg gtg gag tge ege aac cag etg 24 9 9 

55 Cys Ala Gly Gly Ser Gin Val Gin Ala Val Glu Cys Arg Asn Gin Leu 

F 1 0 - 1 c 5 2 2 = 2 5 
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Val Val C-ly Asr. Trp Ser Leu Cys 3er Arc Ser Cys Asp Ala Gly Val 
gfcO 365 " 37 C 

cgc agt cgc teg gtc gtg cag cgc cgc gtc cct gec gcg gag gag 2z?l 

5 Arg Ser Arg Ser Val Val Cys 31 r. Arg Arc Val Ser Ala Ala Glu C-lu 
3 75 c SO 8 SB 

aag gcg ctg gac gac age gca tgc ccg cag ccg cgc cca cct gta ctg 2735 
Lye Ala Leu Asp Asp Ser Ala Cys ?ry Gin Pro Arg Pro Pro Val Leu 
10 690 895 500 905 

gag ccc tgc cac cgc ccc act tgc cct ccg gag egg gcg gec etc gac 27£7 

Glu Ala Cys His Gly Pre Thr Cys ?r: Pro 31u Trp Ala Ala Leu Asp 

910 915 92C 

15 

egg tct gag tgc acc ccc age tgc gcg ccg 3gc etc cgc cac cgc gtg 2635 

Trp Ser Glu Cys Thr Pre Ser Cys Gly Pro 31 y Leu Arg His Arg Val 

925 93: 935 

20 gtc ctt tgc aag age gca cac cac egs gec acg ctg ccc ccg gcg cac 2863 
Val Leu Cys Lys Ser Ala Asp His Arg Ala Thr Leu Pre Pro Ala His 
940 945 950 

tgc tea ccc gec gee aag cca ecg gc: acc acg cgc tgc aac ttg cgc 2S31 
2 5 Cys Ser Pro Ala Ala Lys Pro Pro Ala Thr yet Arg Cys A3n Leu Arg 
955 96 3 965 

cgc tgc ccc ccg gec cgc tgg gtg gee ggc gag tgg ggt gag egc :ct 2573 
Arg Cys Pro Pre Ala Arg Trp Val Ala Gly C-lu Trp Gly Glu Cys Ser 
30 570 975 38C 9S5 

cca cag tgc ggc gtc ggc cag egg cag cgc ecg gtg cgc tgc ace acc 3C2 7 

Ala Gin Cys Gly Val Gly Gin Arg Gin Arg Ser Val Arg Cys Thr Ser 
590 995 1C00 

35 

cac acg ggc cag gcg teg cac cag tgc acg gag gec ctg egg ccg ccc 30~5 

His Thr C-ly Gin Ala Ser His Glu Cys Thr C-lu Ala Leu Arg Pre Pro 
1005 1C10 1015 

40 acc acc cag cag egt gag gee aag tgc gac age cca acc ccc ggg gac 3123 
Thr Thr Gin Gin Cys Glu Ala Lys Cys Asp Ser Pre Thr Pro Gly Asp 
102 0 1C2 5 103 0 

ggc cct gaa gag egc aag gat gtg aac aag gtc gec eac egc ccc etc .;. 1~1 
4 5 Gly Pre Glu Glu Cys Lys Asp Val Asr. Lys ,; il Ala Tyr Cys Pro Leu 
" 103 5 104 0 1C45 

etc eec ^aa ett cag tec tgc age :ga gee tac tee eye cag atg tgc ,-119 
Val Leu Lys fee Glr. Fhe Cys Ser Arg Ala Tyr fhe Arc Sir. :<et Cys 

5: i;=.o 10==' . .-. : 

egc aaa ace tgc cag ggc cac eagggggege geggcaeecg g^gecaeage } 2 7 ? 

Cys Lys Tnr Cys Gin Gly His 

55 

-j j -Jy:- tcogccgcrii gccctgcaje gg gccygcca aagggggecc cggjjygg- : ::3: 



WO HI 



<4 0H 2 4 

y.ez Ser Ser Cys Frc Val Trp Arc Ala Met Arg Ser Pre 5er Pre Frc 
L ' 5 1C 15 

5 

Ala Trp Thr Thr Thr Gly His Cys Trp Pro Ser Arg His Leu Leu Frc 
20 25 3C 

Gly Ala Ala Frc Arg His Gly Gly His Ser Arg Val Pro Fro Leu Leu 
10 3 5 '40 ~ 4 5 

Gin Ser Gly Leu Ala Ser Thr His Phe Leu Leu Asn Leu Thr Arg Ser 
50 * 55 60 

15 Ser Arg Leu Leu Ala Gly Arg Val Ser Val Glu Tyr Trp Thr Arg Glu 
55 75 53 

Gly Leu Ala Trc Sin Arq Ala Ala Arq Pro His Cvs Leu Tyr Ala Gly 
65 SO 95 

2C 

His Lou Gin Gly Gin Ala Ser Ser Ser His Val Ala He Ser Thr Cys 
100 10 5 11C 

Gly Gly Leu His Gly Leu He Val Ala Asp Glu Glu Glu Tyr Leu He 
25 115 120 125 

Glu Pre Leu His Gly Gly Pre Lys Gly Ser Arg Ser Pre Glu Glu Ser 
130 135 140 

3 0 Gly Pre His Val Val Tyr Lys Arg Ser Ser Leu Arg His Pro His Leu 
14 5 150 155 16 0 

Asp Thr Ala Cys Gly Val Arg Asp Glu Lys Pro Trp Lys Gly Arg Pre 
165 " 17C 175 

35 

Trp Trp Leu Arg Thr Leu Lys Fro Pro Pro Ala .Arg Pro Leu Gly Asn 
:ao 135 isc 

Glu Thr Glu Arg Gly Gin Pro Gly Leu Lys Arg Ser Val Ser Arg Glu 

40 195 20 0 2C5 

Arg Tyr Val Glu Thr Leu Val Val Ala Asp Lys Met Kez Val Ala Tyr 
210 215 22 0 

45 V.LH C-ly Arg Arg Asp Val Jl . Glr. Tyr Val Leu AH lie "e: Asn He 
22 5 H C 23: 1 4 J 

Val Al a Lys Leu ] r.e Gl :: A;:: Ser Ser Leu 31 y S e r Thr Va 1 A sr. I 1 * 
2^5 ~ 25; * 255 

51 

Leu Val Thr Arg Leu He Leu Leu Thr Glu Asp Gin Pre Thr Leu Glu 
2 5 0 2 6 5 2 "7 0 

He Thr His His Ala Gly Lys Ser Leu Asp Ser Fhe Cys Lys Trp Gin 
55 275 2H 265 

Lys Ser He Val Asn His Ser Gly His Gly Asn Al 2 He Pre Glu Asn 
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:■ 4 c 345 2 5 0 

Ala Ala Thr £e: Val His His Cys His Glu He Gly His Tnr ?r.e Gly 
3 5 5 : 6 C :• 5 5 

5 

>:et Asr. His Asp Gly Val Gly Asr. Ser Cys Gly Ala Arg Gly Glr. Asp 
370 375 * 363 

Pro Ala Lys Leu Met Ala Ala His He Thr Met Lys Thr Asn Frc Phe 
10 385 390 395 400 

Val Trp Ser Ser Cys Asn Arg Asp Tyr He Thr Ser Phe Leu Asp Ser 
405 410 415 

15 Gly Leu Gly Leu Cys Leu Asn Asn Arg Pro Pro Arg Gin Asp Phe Val 
420 425 43 0 

Tyr Pro Tnr Val Ala Pro Gly Gin Ala Tyr Asp Ala Asp Glu Gin Cys 
435 44C 445 

20 

Arg Phe Gin His Gly Val Lys Ser Arg Gin Cys Lys Tyr Gly 31 u Val 
450 455 460 

Cys Ser Glu Leu Trp Cys Leu Ser Lys Ser Asn Arg Cys He Thr Asn 
25 465 470 475 430 

Ser He Fro Ala Ala Glu Gly Thr Leu Cys Gin Thr His Thr He Asp 
4E5 490 435 

3 0 Lys Gly Trp Cys Tyr Lys Arg Val Cys Val Frc Phe Gly Ser Arg Pro 

500 505 51C 

Glu Gly Val Asp Gly Ala Trp Gly Pro Trp Thr Fro Trp Gly Asp Cys 
515 520 52E 

35 

Ser Arg Thr Cys Gly Gly Gly Val Ser Ser Ser Ser Arg His :ys Asp 
530 ' 535 540 

Ser Pro Arg Fro Thr lie Gly Gly Lys Tyr Cys Leu Gly Glu Arg Arc 
40 545 " 550 555 £60 

Arg His Arg Ser Cys Asn Thr Asp Asp Cys Pre Fro Gly Ser Hn Asp 
565 570 575 

4 5 Phe Arg 31u V?L1 Gin lys Ser Gl j Phe Asp 5cr He Phe Arg Gly 

5 = 0 555 59-: 

lys Tyr Lys Trp Lye Tnr Tyr Arg Giy Ply Gly VH Lys Ha "/. 

5 53 HC H5 

Ser Leu Tnr Ser Leu Ala Glu Gly Phe Asn Fhe Tyr Thr Glu Arg Ala 
610 6 15 62 0 

Ala Ala Val Val Asp Gly Thr Pre Cys Arg Frc Asp Thr Val .Asp lie 

5 5 £25 t3C 635 CAZ 

lys Vai Ser Gly Glu Cys Lys His Val Gly Cys Asp Arg Val Leu Gly 
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lie Glr. Asp Leu Asr leu Ser Leu Ser His Leu Ala Leu Lys Gly Asp 
^35 710 715 720 

5 

3ln Glu Ser Leu Leu Leu Glu 31/ Leu Pro Gly Thr Fro Glr. Fro His 
72= 730 735 

Arg Leu Pro Leu Ala Gly Thr Thr Phe Glr. Leu Arg Gin Gly Pro Asp 
10 740 ' 745 750 

Glr. val Gin Ser Leu Glu Ala Leu Gly Pro He Asn Ala Ser Leu He 

755 76C 765 

15 Val Men Val Leu Ala Arg Thr Glu Leu Pro Ala Leu Arg Tyr Arg Phe 
770 "75 7iC 

Asr. Ala Pro He Ala Arg Asp Ser Leu Pre Pro Tyr Ser Trp His Tyr 
735 790 755 iZ'j 

2C 

Ala Pro Trp Thr Lys Cys Ser Ala Gin Cys Ala Gly Gly Ser Gin Val 
605 810 315 

Gin Ala Val Glu Cys Arg Asn Gin Leu Asp Ser Ser Ala Val Ala Pro 
25 82C 825 830 

His Tyr Cys Ser Ala His Ser Lys Leu Pre Lys Arg Gin Arg Ala Cys 
835 340 845 

3 0 Asn Thr Glu Pro Cys Pro Pro Asp Trp Val Val Gly Asn Trp Ser Leu 

650 855 &60 

Cys Ser Arg Ser Cys Asp Ala Gly Val Arg Ser Arg Ser Val Val Cys 
855 570 675 =30 

35 

Gin Arg Arg Val Ser Ala Ala Glu Glu Lys Ala Leu Asp Asp Ser Ala 
395 £90 395 

Cys Pro Glr. Pro Arg Pro Pro Val Leu Glu Ala Cys His Gly Pro Thr 
43 900 905 91C 

Cys Fro Pro Glu Trp Ala Ala Leu Asp Trp Ser Glu Cys Thr Pro Ser 
915 " 920 925 

4 5 Cys Gly Fro Gly Leu Arg His Arg Val Val Leu Cys Lys Ser Ala Asp 

s 3 :■ * 3 5 4 c 

Hi? Aiq Ala Thr Leu r i: Pi- Al = H : :- Cys £~i L:: Ala A. a Ly~ Fro 
r4= ft; :-=5 i5J 

= C : 

Fro Ala Thr Men Arg Cys Asn Lei. Arg Aig Cys ?::c Pro Ala Arg Trp 

3 5 5 5 n : ? 7 5 

Val Ala Gly Glu Trp Gly Glu Cys Ser Ala Gin Cys Gly Val Gly Gin 

55 ?EC 3 -.5 

Aig Gli. Arg Ser V^l Arg Cys Ser H 1 s 7 hi Gly Glr. Ala St=: His 
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Ser Arc Ala Tyr Fhe Arg 

; c s j 



In. Xec Cys Cys Lys 
IC6 5 



ir Cys Glr. Gly His 
107C 



<210> 2 5 

<211> 5990 

10 <:i2> DNA 

<213> Kot.0 sapiens ADAPTS- 3fc 

< :: 2 0 > 
<221> CDS 
15 <222> (33) . . ;5534 ) 

<400> 25 

tsqqcccaqc ccaqcq aqac qcqccaagca cc atg cac tct gta ccc tgg gec 53 

Met Glr. Phe Val Ser Trp Ala 
20 15 

aca ccg eta acg etc ctg gtg egg gac ctg gec gag atg ggg age cca 101 

Thr Leu Leu Thr Leu Leu Val Arg Asp Leu Ala Glu Met Gly Ser pre 

10 15 20 

25 

gac gee geg gcg gec gtg cgc aag gac acg etc cac ccg acq caa gtg 149 

Asc Ala Ala Ala Ala Val Arg Lys Asp Arg Leu His ?rc Arg Gin Val 

25 3-! * 35 

30 aaa tta tta cac acc ctg age gaa tae gaa ate gtg tct ccc ate cga 157 
Lys Leu Leu Glu Thr Leu Ser Glu Tyr Glu He val Ser Pro He Arg 
4 0 4 5 50 5 5 

gtg aac get etc gga gaa ccc ttt ccc acg aac gtc cac tte aaa aga 245 
35 Val Asn Ala Leu Gly Glu Pro Fhe Pro Thr Asr. Val His Phe Lys Arg 

60 6 5 7C 

acg cga egg age act aac tct gec act cac ccc tgg cct gec tte ccc 293 
Thr Arg Arg Ser He Asn Ser Ala Thr Asp Pro Trp Pre Ala Fhe Ala 
40 " "7= 6 0 65 

tec tec tct tec tec tct ace tec ccc cag gec cat tac cge etc tct 341 

Ser Ser Ser Ser Ser Ser Thr Ser Pre Gin Ala His Tyr Arg Leu Ser 

9 C 9 5 1 C ; 

■i = 

gec tte gge cac eag ttt eta ttt aat etc arc gee aa: ge: gga ttt HJ 

Ala Phe Gly Gin Gin Phe Leu Phe A~n Leu Thr Ala Asr. Ala Gly Phe 

1 T r 1 : . ". 1 c 

ul ate get tea ctg tic act gtc c*ec etc etc ggg ccg cc. ggg gtg a .-it 
He Ala Pre Leu Fhe Thr Val Thr Lej Leu Gly Thr Pre Gly Val Asn 
123 125 13C 13d 



cag acc aag ccc tat tee gaa gag gaa ccg gaa etc aag cac tgt tte 
55 31n Thr Lys Fee Tyr Ser Glu Glu Glu Ala Glu Leu Lys His Gye Fhe 
1 4 J 1 4 E 1:1 
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Phe He Glu ?rc Leu 31 n Se: Xe: Asp 31 u 31r. Glu Asp Glu Glu 31 u 

195 15C 195 

:aa aa: aaa :cc cac ate ate :a: agg eg- age ccc :cc cag aga gag 577 

5 3ln Asn Lya Pre His He He Tyr Arc Arg Ser Ala Pre 31 r. Arg 31u 

20C 205 " 210 215 

ccc tea aca gga agg cat gca tgt gac acc tea gaa cac aaa aat agg 725 
Pro Ser Thr Gly Arg His Ala Cys Asp Thr Ser Glu His Lys Asn Arg 

10 220 225 23C 

cac agt aaa gac aag aag aaa acc aga gca aga aaa egg gga gaa agg 773 

Hie Ser Lys Asp Lys Lys Lys Thr Arc Ala Arg Lys Trp Gly Glu Arg 

235 24C 245 

15 

att aac erg get get gac gra gca gca tta aae age ggc era gca aea 821 

He Asn Leu Ala Gly Asp Val Ala Ala Leu Asn Ser Gly Leu Ala Thr 

2 5 0 25 5 26 0 

20 gag gca ttt tct get tat ggt aat aag aeg gae aac aca aga gaa aag 865 

Glu Ala Fhe Ser Ala Tyr Gly Asn Lys Thr Asp Asn Thr Arg Glu Lys 

265 270 " 275 

agg acc eac aga agg aea aaa cgt ttt tta tee tat eca egg ttt gta 917 

2 5 Arg Thr His Arg Arg Thr Lys Arg ?he Leu Ser Tyr Pro Arg Phe Val 

2S0 235 230 295 

gaa gtc ttg gtg gtg gea gac aac aga a eg gtt tea tac cat gga gaa ?£5 

Glu Val Leu Val Val Ala Asp Asn Arg Met val Ser Tyr His Gly Glu 

30 3 00 3 05 310 

aae ctt eaa eac tat art tta act tta atg tea att gta gee tct ate 1013 

Asn Leu Gin His Tyr He Leu Thr Leu Met Ser He Val Ala Ser He 

315 32C 325 

35 

tat aaa gac cea agt att gga aat tta att aat att gtt att gtg aac 1C£1 

Tyr Lys Asp Pre Ser He Gly Asn Leu He Asn He Val He Val Asn 

33 0 33 5 3 4 0 

4 0 tta att gtg att cat aat gaa cag gat ggg cct tec ata tct ttt aat 11C9 

Leu He Val He His Asn Glu Gin Asp Gly Fro Ser lie Ser Phe Asn 

3 4 5 3 5 0 3 5 5 

get cag aea aea tta aaa aac ttt tge eag tge cag cat teg aae agt HE" 7 

•»5 Ala Gin Thr Thr Leu Lys Asn Phe lys Gin Trp 31 n His Ser Asn Ser 

leO 36: 3H 3"5 

era ggt gga at z eat tat gat act get gtt et r tta aea aga eag .jat 11 : = 

Pre Gly Gly He His His Asp Thr Ala Val Leu Leu Thr Arg Gin Asp 

a j-c ■ zb ib-: 

ate tge aca gtt cat gae =aa tgt gat ace tta gge ctg get gaa etg 1253 

lie lys Arg Ala His Asp Lys Cys Asp Thr Leu Gly Leu Ala Glu Leu 

3 53 4 00 4 05 

^ - 

gga acc ~tt tgt gat tat aga age tgt tet att agt gaa gat agt HI". 

Gly Thr He Cys Asp Pro Tyr Arg Ser Cys Ser He Ser 31u .Asp S<;r 
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dat cc: cag cat gec arc get cca aca etc aa: ::c :ac ac: aac ccc I 4 4 5 

Ser Pro Gin His Val Met Ala Pre Thr Leu Asn Phe Tyr Tr.r Asn Fro 

462 465 4 ~ C 

5 

tgg a to tgg tea aag tgt agt cga aaa tat ate act gag ttt tta gac 142-3 

Trp Met Trp Ser Lys Cys Ser Arg Lye Tyr :le Thr Glu Phe Leu Asp 

475 48C 485 

10 act ggt Cat cgc gag tgt ttg ctt aac gaa cct gaa tec aga ccc tac 1541 

Thr Gly Tyr Gly Glu Cys Leu Leu Asn Glu Fro Glu Ser Arg Pre Tyr 

433 455 503 

cct ctg cct gtc caa ctg oca g?c ate ctt tac aac gtg aac aaa caa 15S5 

15 Pre Lee. Pre Val Gin Leu Pro Gly He Leu Tyr Asn Val Asn Lys Gln 

305 51J £15 

tgt gaa ttg att ttt gga cca g=t tec cag gtg tec cca tat atg atg 163" 

Lys ji j i,eu lie tr.e Gly Pro Gly Ser Gin Val Cys Pro Tyr Met Me 

20 523 525 530 535 

cag tgc aga egg etc tgg teg ast aac gtc aat gga gta cac aaa ggc 1685 

Gin Cys Arg Arg Leu Trp Ser Asn Asn Val Asn Gly Val His Lys Gly 

540 545 550 

25 

"9- egg act cag cac aca eee tgg gec gat ggg acg gag tgc gag cct 1733 

Cys Arg Thr Gin His Thr Pro Trp Ala Asp Gly Thr Glu Cys Glu Pro 

555 56C 56 5 

30 gga aag cac tgc aag tat gga ttt tgt gtt ccc aaa gaa atg gat gtc 1751 

Gly Lys His Cys Lys Tyr Gly Pne Cys Val Pro Lys Glu Met Asp Val 

57C ' 5^5 53C 



ccc gtg aca gat gga tec tgg g^a agt tgg agt ccc ttt gga ace tgc 
3 5 Pro Val Thr Asp Gly Ser Trp Gly Ser Trp Ser Pro Phe Gly Thr Cys 
58 5 5 90 555 



1625 



tec aga aca tgt gga ggc cgc ate aaa aca gec att cga cag tgc aac 16" 7 

Ser Arg Thr Cys Gly Gly Gly He Lys Thr Ala He Arc Glu Cys Asn 

4C 600 ^ 6 05 610 615 

aca cca gaa cca aaa aa: ggt cga aaa tsc tgt gta gga cgt aga atg 1S25 

Arg Pre- Glu Pro Lys Asn Gly Gly Lys Tyr Cys Val Gly Arg Arg Met 

€ 2 C 6 2 5 6 3 2 

4 5 

aaa ::: aag tee tgc aac a eg gag c::i tgt etc aag cag aag ega g^e 

Lys Phe Lys Ser Cys Asn Thr Glu Pre Cys Leu Lys Gin Lys Arg As:: 

• ■' •; 

'. - - 7 r r z -j - aii :i: - c : 7 :■ : en:" : *: j j 7 -. :-- ^ r " " r : r ■ ■■ a * e I " 2 '.. 

Phe Arg Asp Glu Gin Cys Ala His Phe Asr Gly Lys :-:is Pne Asn lie 

65 J * 6 55 6 6C 

aac ggt ctg ctt ccc aat gtg cgc tgg etc cct aaa tac agt gga att 2259 

E5 Asn Gl .■ Leu Leu Pre Asn Val Arg Tre Val Pro Lys Tyr Ser Gly He 

£65 6 7 c " 6^5 

-7 --* ~ 3 t 7 = - r " * 1 1 ": r t r. : ~ : :s 3 3 e aea I 1 H 
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Gin Asp Thr Asr. Asp lie Cys Val Glr. Gly Leu Cys Arg Gin Ala Gly 

715 * "2C ' 725 

cgc qa: :a: get cca aa: tea aaa gec egg aga gat aaa tgc ggg get 22£: 

5 Cys Asp His Val leu Asn Ser Lye Ala Arg Arg Asp Lys Cys 31y Val 

73? 7 35 74C 

tgt gat ggc gat aat tec tea tgc aaa a:a gtg gca gga a;a ttt aac 23C3 

Cys Gly Gly Asp Asr. Ser £er Cys Lys Thr Val Ala Gly Thr Phe Asn 

10 745 * 750 755 

aca gca cat tat ggt tac aac act gcg gtc cga att cca get ggt gec 2357 

Thr Val His Tyr Gly Tyr Asn Thr Val Val Arg He Pro Ala Gly Ala 

7£C 765 770 775 

15 

ace aac ate cat gtg egg cag cac agt etc tea ggg gaa aca gac gac 24C5 

Thr Asn He Asp Val Arg Gin His Ser Phe Ser Gly Glu Thr Asp Asp 

750 7=5 750 

20 gac aac tac tta get tea tea age agt aaa ggc gaa tec ttg eta aat 24E3 

Asp Asn Tyr Leu Ala Lej Ser Ser Ser Lys Gly Glu Phe Leu Lei: Asn 

735 80C B05 

gga aac ttt gtt gtc aca atg gee aaa acg gaa ate cgc att ggg aac 25C1 

2 5 Gly Asn Phe Val Val Thr Xec Ala Lys Arg Glu He Arg lie Gly Asn 

310 515 620 



get gtg gca gag tac agt ggg tee gag acc gec gta gaa aga att aac 
Ala Val Val Glu Tyr Ser Gly Ser Glu Thr Ala Val Glu Arg lie Asn 
30 825 830 835 



2543 



9 7 



tea aca gat cgc att gag caa gaa ctt ccg etc cag gtc ttg ccg gcg 

Ser Thr Asp Arg He Glu Gin Glu Leu Leu Leu Gin Val Leu Ser Val 

e 4 G 845 350 S55 

35 

gga aag ccg tac aac ccc gat gca cgc cat tct ttc aac ate cca act 2645 

jly Lys Leu Tyr Asn Pre Asp Val Arg Tyr Ser Phe Asn He Pre lie 

SEC 5s5 

40 gaa gat aaa ccc cac cag ttt cac egg aac age cat egg cca egg caa 2633 

Glu Asp Lys Pre Gin Gin Phe Tyr Trp Asn Ser His Gly Pro Trp Gin 

875 S80 335 

jca tgc agt aaa ccc tgc :aa ggg gaa egg aaa cga aaa cct gtc cgc 2741 

4 5 Ala Cys Ser Lys Pre Cys Gin Gly Glu Arg Lys Arg Lys Leu Val Cys 

? 9 C c 5 5 ? C C 

see agg gaa ccc gat eag etc act gtc cct gae eaa aga tgc ;ac egg 

Arg Glu Ser Asp He. Leu Thr Val Ser Asp Gin Ar g Cys Asp Arg 

f: : 'r : 5 c l ': ? '. 5 

:tg ccc cag ccc gga cac ate acc gaa ccc tgc ggc aca ggc tge. gac 2 33" 

Leu Pre Gin Pro Gly His He Thr Glu Pre Cys Gly Thr Gly Gys Asp 

S2C ' 52 B 935 

55 

ctg agg egg cat get gec age agg age gaa tgt age gee cag tgc ggc 2 S 3 5 

Leu Arg Trp His Val Ala Ser Arg Ser Glu lys Ser Ala Gin Cys Gly 

r - 4 " G 4 r r ^ " 
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aaa ;:a age aac egt 53a aaa tgt tea ggg gaa :ct aac acg ggt ggt 3H9 

Lys ?rc Ser Asn Arg Glu Lys Cys Ser Gly Glu Cys Asn Thr Gly Gly 

935 990 955 

5 

tgg cgc tat tct gec tgg act gaa tgt tea aaa age tgt gac ggt ggg 3077 

Trp Arg Tyr Ser Ala Trp Thr Glu Cys Ser Lys Ser Cys Asp Gly Gly 

1CC0 ' 1005 lDIO 1315 

10 ace eag agg aga agg get att tgt gtc aat aec ega aat gat gta ctg :i25 

Thr Gin Arg Arc Arg Ala He Cys Val Asn Thr Arg Asn Asp Val Leu 

"l02C 1025 1030 

gat gac age aaa tgc a;a cat eaa gag aaa jtt ace att tag agg tgt 1-173 

15 Asp Asp Ser Lys Cys Thr His Gin Glu Lys Val Thr He 3ln Arg Cys 

1035 10H i:45 

agt gag ttc cct tgt eea cag tgg aaa tet ^ga gac tgc tea gag tgt 1221 

5ei 31 u ?i;c ?:u Cys Pi- Gl.i Hp Lys 3ci 31/ Asp Trp Ser Glu Cys 

20 1050 1055 1060 

ttg gtc ace tgt gga aaa ggg tat aag eac tgc cag gtc tgg tgt tag .2£9 

Leu Val Thr Cys Gly Lys Gly His Lys His Arg Gin Val Trp Cys 31 r. 

105 5 1C7 0 i:7 5 

25 

ttt ggt gaa gat ega tta aat gat aga atg tgt gac ect gag acc aag 1317 

Phe Gly Glu Asp Arg Leu Asn Asp Arg Met Cys Asp Fro Olu Thr Lys 

1C6 C "l085 1 C90 1395 

30 eea aca tct atg cag act tgt cag cag eeg gaa atg gca tee tgg raj 1-365 

?rc Thr Ser Ket Gin Thr Cys Gin Gin Fro 3lu Met Ala Ser Trp 31n 

HOC 1105 111C 



gcg ggt ecc tgg gta cag tgc agt gtc act tgt gga cag gga tac tag 
35 Ala Gly Fro Trp Val Gin Cys Ser Val Thr :ys Gly Gin Gly Tyr 31r. 

1115 112C 1125 



:- 4 : 3 



eta aga gca gtg aaa tge ate att ggg act tat atg tea gtg gta gat :-4£l 

Leu Arg Ala Val Lys Cys He lie Gly Thr Tyr Ket Ser Val Val Asp 

40 1133 1135 114C 

gac aat gac tgt aat gta gca act aga eea act gat acc tag gac tgt 35C9 

Asp Asn Asp Cys Asn Ala Ala Thr Arg Pro :hr Asp Thr Gin Asp Cys 

1145 he: he 5 

- 

gaa tta eea tea tgt eat cct tee tea get eeg gaa a eg agg ica " - 

Glu Leu F:o Ser Cys His Fro Pre Pre Ala Ma Pre Glu Thr Arg \rc 

: 1 6 j 1 : £ b 1 1 "j n ~ = 

i 1 age a;- a t a :: igt jta tta aga :.;t .u;: tgg :ga ttt jgg r :: t tgg 

Ser Thr Tyr Ser Ala Frt Arg Thr Glr. Trp Arg Che Gly Ser Trp Chr 
116C 1135 1190 

eta tgc tea gtc act tgt ggg aaa ggt aec tgg atg aga tat gte age 3£53 

5 5 Pre Cys Ser Ala Thr Cvs Gly Lys Gly Thr Arg Met Arg Tyr Val Ser 
1:95 ' ::c: 

-~r "a rra 1 : ~ar: a a . t tf <~.<- agt gte tgt .trer a~t 
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Glr. Tip Lys Ala Leu Asp Trp Ser Ser Cys Ser v c 1 Thr Cys 01 y Glr. 
1Z4: 1245 125C 1255 

ggc agg qca acc egg caa gig ate tgt gtc aac :ac agt gac cac gtg 3 34 5 

5 Gly Arg Ala Thr Arg Gin val Met Cys Val Asn Tyr Ser Asp His val 
1260 * 1265 1270 

ate gat egg agt gag tgt gac cag gat tat ate eca gaa act gac cag 3593 

lie Asp Arg Ser Glu Cys Asp Gin Asp Tyr He Pro Glu Thr Asp Gin 
10 1275 1280 1285 

gac tgt cc: atg tea cea cgc ect caa agg ace eca gac agt ggc tta 2S41 

Asp Cys Ser Y.et Ser Pre Cys Fro Glr. Arg Thr Fro Asp Ser Gly Leu 
1290 1255 13C0 

15 

get cag cac ccc ttc caa aa: gag gac tat egt ccc egg age gec age 593? 

Ala Gin His Pro Phe Gin Asn Glu Asp Tyr Arg Fro Arg Ser Ala Ser 

13 05 1310 1315 

2 0 ccc age cgc aec eat gtg etc ggt gga aac cag egg aga act ggc ccc 4C37 

Pro Ser Arg Thr His Val Leu Sly Gly Asn Gin Trp Arg Thr Gly Pro 
1323 1325 * 1330 1335 

egg gga gca tgt tee agt acc tgt g;t cgc gga tec cag egg cgc gtt 4CS5 

2 5 Trp Gly Ala Cys Ser Ser Thr lys Ala Gly Gly Ser Gin Arg Arg Val 

1340 1345 135 0 

gtt gta tgt cag gat gaa aat gga tac acc gca aac gac tgt gtg gag 4133 

Val Val Cys Glr. Asp Glu Asn :-ly Tyr Thr Ala Asn Asp Cys val Glu 

3 0 13 55 13 6 0 13 6 5 

aga ata aaa ccc gat gag caa aga gee tgt gaa tec ggc cct tgt ccc 4 131 

Arg He Lys Pro Asp Glu Gin Arg Ala Cys Glu Ser Gly Pro Cys Pro 
1370 13^5 1330 

35 

cag tgg get tat ggc aac tgg gga gag tge act aac ctg tgt ggc gga 4229 

Gin Trp Ala Tyr Gly Asn Trp Sly Glu Cys Thr Lys Leu Cys Gly Gly 

1385 1390 1395 

40 ggc ata aga aea aga ctg gtg gtc tgt cag egg tec aac ggt gaa egg 4-27: 1 

Gly He Arg Thr Arg Leu Val Val Cys Gin Arg Ser Asn Gly Glu Arg 
1400 1405 141C 1415 

ttt eca gat ttg age tgt gaa ate ett cat aaa cct ccc gat egt gag 4323 

45 rhe Pre. Asp Leu Sar Cys Glu He Leu Asp Lys f ro Ire Asp Arg 31 u 
14 2 0 142 5 14 3 1 

c_g tgt aac aca eat get tgt eca eae gar get gca tgg agt at: ggc 4 ; ~ > 

Gin Cys Asr. Thr His Ala lys Pro His Asr Ala Ala Trp Ser Thr Gly 
;j 14H 144C 144; 

cct tgg age teg tgt tct etc tet tgt ggt ega ggg cat aaa caa ega 4 42 - 

Pre Trr Ser Ser Cys Ser Val Ser Cys Gly Arg Gly His Lys Glr. Arg 
14 5 0 14 5 5 14 6 0 

S 5 

aat gtt tac tgr atg gea 33.* gat gga sgc eat tta gaa agt gat tac 44-1? 

Asr. Val Tyr Cys yet Ala Lys Asp Gly Ser His Leu Glu Ser Asp Tyr 
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cgc ggc cga ggc gea rag cag agg g:g ggc cgt cag ate gga aca 4612 

Cys C-ly Arg Giy Val Gin Gin Arc His Va 1 Gly Cys 31 r. He Gly Thr 
1515 1520 1525 

5 

cac aaa ata gec aca gac acc gag cgc aac cca tac acc aga ccg gag 4661 
His Lys He Ala Arg Asp Thr Glu Cys Asn Fro Tyr Thr Arg Pro Glu 
1530 1535 1540 

10 ccg gaa tgc gaa cgc caa gec cca egg tgc ccc ctt cac acc tgg agg 4709 
Ser Glu Cys Glu Cys Gin Giy Fro Arg Cys Pre Leu Tyr Thr Trp Arg 
1545 1550 1555 

gca gag gaa ccg caa gaa cgc acc aig acc tgc ggc gaa ggc ccc agg 475" 
15 Ala Glu Glu Ser Gin Glu Cys Thr Lys Thr Cys Gly Glu Giy Ser Arg 
156C 1555 157G 1575 

cac cgc aaq acq qtq tqc gtg qac qac aac aaa aac qaq qtq cac qqq 4305 
Tyr Arg Lys Val Vai Cys Val Asp Asp Asn Lys Asn Glu Val His Gly 
20 159C 1535 159C 

gca cgc Cgc gac gtg age aag egg ccg gcg gac cgt gaa age cgc age 4553 

Ala Arg Cys Asp Val Ser Lys Arg Pro Val Asp Arc Glu Ser Cys Ser 
1595 1603 1605 

2 5 

ccg caa ccc cgc gag tac = tz egg acc aca gga gaa egg tea gag cgc 4901 

Leu Gin Pr; Cys Glu Tyr Val Trp lie Thr Gly Glu Trp Ser Glu Cys 
1613 1615 1620 

30 cca gtg acc cgt gga aaa 5gc cac aaa eaa agg cct gec teg cgc age 4345 
Ser Val Thr Cys Gly Lys 3-ly Tyr Lys Gin Arg Leu Val Ser Cys Ser 
1625 1630 1635 

gag att cac acc ggg aaa gag aac cat gaa cac age cac eaa acc acc 4997 

3 5 Glu lie Tyr Thr Gly Lys 31u Asn Tyr Glu Tyr Ser Tyr Gin Thr Thr 

164C 1645 1650 1555 

acc aae tgc cca ggc acg cag ccc ccc agt gtt cac ccc tgt cac ctg 5C4 5 
He Asn Cys Pro Gly Thr Gin Pro Pro Ser Val His Pro Cys Tyr Leu 
4C 16 6 0 1665 167C 

agg gag tgc cct gtc teg gec acc egg aga gtt ggc aac egg ggg age 5053 
Arg Glu Cys Pre Val Ser Ala Thr Trp Arg Val Sly Asn Trp Gly Ser 

:675 is;;: 1 6 3 5 

cgc era gca ere eg: ggc gtt gga geg acg rag aga ccc gtg caa cgc E14 1 
I'ys Ser Val Ser rye Gly Va 1 Giy Vai Moe Jin Arg Ser Val Gin Cys 
It*: i^cc 

iii c:a sec aac gag gac eaa ccc age cac cca c g r :ac act gac ccg aag : ■ - 
Leu Thr Asn Glu Asp Gin Pre Ser His Leu Cys His Thr Asp Leu Lys 
1"j5 17i: 1715 

cca gaa gaa ega aaa acc cgc cgc aae gec tac aac cgt gag tea ccc 52 3 ~ 
55 Pre Glu 31 u Arg Lys Thr Cys Arg Asn '-'si Tyr Asr. Cys Glu Leu Pre 
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Gly .Vet His £e: Asp His Pre Lys Glu Tyr Val Thr Leu Val His Gly 
177; 17 75 17 gC 

gat tct gag ddt :cc gag gtt :a: ggg cac agg era :a: aa: cca 

5 Asp Ser Glu Asr. Phe Ser Glu Val Tyr Gly His Arc Leu His Asr. Pro 
1735 1790 17S5 

aca gaa tgt c:c rat aac ggg age egg cgc gat gac tgc caa tgt egg 5477 
Thr Glu Cys Pre Tyr Asr. Gly Ser Arc Arg Asp Asp Cys Gin Cy3 Arg 
10 180C 1805 1610 1815 

aag gat tac acg gec get ggg ttt tec agt ttt cac aaa ate aca ata 5525 

Lys Asp Tyr Thr Ala Ala Cly Phe Ser Ser Phe Gin Lys lie Arg lie 
1=20 1925 1633 

15 

ga : etc acc age atg cag ata ate ace act gac tta cag ttt gca agg 5572 

Asp Leu Thr Ser Met Gin He lie Thr Thr Asp Leu Gin Phe Ala Arg 
1835 184C 1845 

20 aci age gaa gga cat ccc :tc cct ttt gec aca gee ggg gat tgc tac 5621 
Thr Ser Glu Gly His Frc Val Pro Pne Ala Thr Ala Gly Asp Cys Tyr 
1850 1855 1S6C 

igc get gec aag tgc cca tag ggt cgt ttt age ate aac ctt tat gza 5££9 
25 Ser Ala Ala Lys Cys Frc Gin Gly Arg Phe Ser He Asr. Leu Tyr Gly 
1355 1370 18^5 

acc ggc ttg tct tta act gaa tct gec aga tgg ata tea caa ggg aat 571"' 
Thr Gly Leu Ser Leu Thr Glu Ser Ala Arg Trp lie Ser Gin Gly Asn 
30 1880 1385 1B90 1855 

tat get gtc tct gac ate aag aag teg ccg gat ggt ace cga etc gta 5765 

Tyr Ala Val Ser Asp He Lys Lys Ser Pro Asp Gly Thr Arg Val Val 
19CC 1905 1910 

3 5 

ggg aaa tgc ggt ggt tae tgt gga aaa tgc act cea tee tct ggt act 5 6 1 n 

Gly Lys Cys Gly Gly Tyr Cys Gly Lys Cys Thr Pre Ser Ser Gly Thr 
1515 152C 1925 

4 0 gge erg gag gtg cga gtt tta tagctaaggt gctttgaaga ggaagecatt 5££4 

Gly Leu Glu Val Arg Val Leu 
153 0 

•jtggatggat gaaggatagt tiatgeaatae cteeacctta stttcgctgc atgtgtatgt =-92 4 

4 5 

^tgtgigt :;*_ ttgtgtgtga ct tgt .it cc 1 t-:t-}~gtgt^ -atgtgtgta >■ . ' . ; . 1 - . 
a t aca 

-:2 10> 2t 

<2 11> 193 4 
<212> rRT 

■ :213> Here sapiens ADAPTS -3c 

■. 4 : :i > * e 

!'et Gin Fhe Val Ser Trp Ala Thr Leu Leu Tr.r Leu L* u Val Arg Asp 
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50 55 6C 

Tr.r Asn Val His Fhe Lys Arg Thr Arg Arg Ser :le Asr. 5er Ala Tr.r 

cs 7: -5 =■: 

5 

Asp Fro Trp Fro Ala Fhe Ala Ser Ser Ser Ser Ser Ser Thr Ser Fro 
65 90 S5 

Gin Ala Kis Tyr Arg Leu Ser Ala Fhe Gly Gin Gin Fhe Leu Phe Asn 
10 IOC 105 110 

leu Thr Ala Asn Ala Gly Fhe lie Ala Fro Leu Phe Thr Val Thr Leu 
115 12C 125 

15 Leu Gly Thr Frc Gly Val Asn Gin Thr Lys Fhe Tyr Ser Glu Glu Glu 
13C 135 14C 

Ala Glu Leu Lys His Cys Phe Tyr Lys Gly Tyr Val Asn Thr Asn Ser 
L-it IbC 155 16C 

20 

Glu His Thr Ala Val He Ser Leu Cys Ser Gly Ket Leu Gly Thr Fhe 
165 17 0 175 

Arg Ser His Asp Gly Gly Tyr Phe He Glu Fro Leu Gin Ser Met Asp 
25 ISC 135 190 

Glu Gin Glu Asp Glu Glu Glu Gin Asr. Lys Fro His lie He Tyr Arg 
135 2 CO 2C5 

3 0 Arg Ser Ala Pro Glr. Arg Glu Fro Ser Thr Gly Arg His Ala Cys Asp 

210 215 22C 

Thr Ser Glu His Lys Asn Arc His Ser Lys Asp Lys Lys Lys Thr Arg 

225 * 23C " 23*5 " 24C 

35 

Ala Arg Lys Trp Gly Glu Arg He Asn Leu Ala C-ly Asp Val Ala Ala 

245 23C 255 

Leu Asn Ser Gly Leu Ala Thr Glu Ala Fhe Ser Ala Tyr Gly Asn Lys 
40 260 265 27 0 

Thr Asp Asn Thr Arg Glu Lys Arg Thr His Arg Arg Thr Lys Arg Phe 
2 7 5 2 3 0 2 f 5 

4 5 Leu Ser Tyr Fr: Arg Phe Val Glu Val Leu Val Val Ala A = p Asn Arg 

29: 2?z 3c: 

Met Vu.1 St.:r Tyr His Giy 3; u As:: leu Glr. = Tyr He leu Tr.r Leu 

:■ c 5 lie 315 

r -, 

Met Ser He Val Ala Ser He Tyr Ly = Asp Fro Ser He Gly ksn Leu 

3 2 5 3 3 C 3 3 5 

He Asr. He Val He Val Asr. Leu He Val He His Asr, Glu Gin Asp 
55 34 0 345 3- 5 C 

Gly pre Ser He Ser Phe Asn Ala Glr. Tr.r Thr Leu Lys Air. fne Cys 

-, rr: \ c ■ ~. c ~ 



WO Hi I M"4 P( I I SUM 21223 

SO 



4:5 4:0 415 

Cys Ser :le Ser Glu Asp Ser Gly leu Ser Thr Ala Fhe Thr He Ala 
4 2C 4 2E 43: 

5 

Kis Glu Leu Gly His Val F.ne Asr. Met Fro His Asp Asp Asr. Asr: Lys 
435 443 445 

Cys Lys Glu Glu Gly Val Lys Ser Pre Gin His Val Met Ala Fro Thr 
1C 45C 455 46C 

Leu Asn Fhe Tyr Thr Asn Pro Trp Met Trp Ser Lys Cys Ser Arg Lys 
455 4 70 475 4 6 0 

15 Tyr He Thr Glu Phe Leu Asp Thr Gly Tyr Gly Glu Cys Leu Leu Asr. 

465 4 r 0 495 

Glu Pro Glu Ser Arg Pro Tyr Pro Leu Pro Val Gin Leu Pro Gly lie 
: 0 0 5 0 5 d 1 0 

20 

Leu Tyr Asn Val Asr. Lys Sir. Cys Glu Leu He Fhe Gly Pro Gly Ser 
515 52C 525 

Gin Val Cys Pre Tyr Met Met Gin Cys Arg Arg Leu Trp Ser Asn Asn 
25 53C S35 540 

Val Asn Gly Val Kis Lys Gly Cys Arg Thr Gin His Thr Pro Trp Ala 
545 550 555 560 

3C Asp Gly Thr Glu Cys Glu Pro Gly Lys H.:s Cys Lys Tyr Gly Fhe Cys 
565 570 575 

Val Pro Lys Glu Met Asp Val Pro Val Thr Asp Gly Ser Trp Gly Ser 
58 0 5 95 S9C 

35 

Trp Ser Pro Fhe Gly Thr Cys Ser Arg Thr Cys Gly Gly Gly He Lys 
595 6CC 605 

Thr Ala He Arg Glu Cys Asn Arg Pro Glu Pro Lys Asn Gly Gly Lys 
4 0 £10 S15 62 0 

Tyr Cys Val Gly Arg Arg V.et Lys Fhe Lys Ser Cys Asn Thr Glu Fro 
525 630 €35 64C 

4 5 Cys Leu Lys Gin Lys Arg Asp Phe Arg Asr Glu Gin Cys Ala His Fhe 

£ 4 : 6 5 J Hi 

Af?p .7 1 y Lys His Ph* Asn 1 1 e Asn Hy Le j Le.. Ho VH Ar g Hp 

->:■; he- •; ~ : 

s :■ 

Val Frc Lys Tyr 5er Gly He Leu "et Lys Asr Arg Cys Lys Lc_ Fhe 

675 6 6 C 595 

Cys Arg Val Ala Gly Asn Thr Ala Tyr Tyr Gin Leu Arg Asp Arg Val 
55 653 £55 ' LC 



He Asp Jl 



Aen Asp He Cys Val Glr. 
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-E= 7SC 765 

Val Arg He Pre Ala Gly Ala Thr Asn lie Asp Val Arg Gin His Ser 

77c "73 "50 

5 

Phe Ser 3 1 >■ Glu Thr Asp Asp Asp Asn Tyr Leu Ala Leu Ser Ser Ser 
765 790 79S 500 

Lys Gly Glu Phe Leu Leu Asn Gly Asn Phe Val Val Thr Met Ala Lys 
10 ' 6C5 810 ei5 

Arg Glu lie Arg He Gly Asr. Ala Val Val Glu Tyr Ser Gly Ser Glu 
923 S25 330 

15 Thr Ala Val Glu Arg He Asn Ser Thr Asp Arg lie Glu Gin Glu Leu 
835 840 845 

Leu Leu Gin Val Leu Ser Val Gly Lys Leu Tyr Asn Fro Asp Val Arc 
550 855 S60 

20 

Tyr Ser Phe Asn :le Pro He Glu Asp Lys Pro Gin Gin Phe Tyr Trp 
865 67C 375 880 

Asn Ser His Gly Pro Trp Gin Ala Cys Ser Lys Fro Cys Gin Gly Glu 
25 8S5 850 395 

Arg Lys Arg Lys Leu Val Cys Thr Arg Glu Ser Asp Gin Leu Thr Val 
900 305 * 510 

3C Ser Asp Gin Arg Cys Aep Arg Leu Pre Glr. Pro Gly His He Thr Glu 
915 520 925 

Fro Cys Gly Thr Gly Cys Asp Leu Arg Trp His Val Ala Ser Arg Ser 
92C 935 340 

35 

Glu Cys Ser Ala Gin Cys Gly Leu Gly Tyr Arg Thr Leu Asp He Tyr 
945 95C 955 S6C 

Cys Ala Lys Tyr Ser Arg Leu Asp Gly Lys Tnr Glu Lys Val Asp Asp 
40 965 970 S75 

Gly Phe Cys Ser Ser His Pro Lys Pre Ser Asn Arg Glu Lys Cys Ser 
930 5 85 590 

•i 5 Gly Glu Cys A sr. Tnr Gly Gly Trc Arc Tyr S«r Ala Trp Thr Glu Cys 

9 "r 5 ' : : : o : c c s 

rer Lys Ser 7y- A = ;: Gly Gly Tr.r Glr. Arc Arc A:g Al h : 1 «- 7y* Val 

: : : c : j 1 5 l a : 

5 0 

Asn Thr Arg Asn Asp '.'al Leu Asp Asp Ser Lys Cys Thr His Gin Glu 

;c2 5 " :o:-c " ::^5 ic-ic 

Lys Val Thr He Gin Arc Cys Ser Glu Phe Pro Cys Pre Gin Trp Lys 

5 5 H4 5 1j:,C H5 5 

Ser Gly Asp Trp Ser Glu Cys Leu Val Thr Cys Gly Lys Gly His Lys 
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Thr Cys Gly Glr. Gly Tyr Gin Leu Arc Ala Val Lys Cys lie lie Gly 
112 5 113 0 Hi 5 

5 

Thr Tyr Met Ser Val Val Asp Asp Asr. Asp Cys Asn Ala Ala Thr Arg 
1141 1145 115C 

Pro Thr Asp Thr Gin Asp Cys Glu Leu Fro Ser Cys His Fro Pro Pro 
10 1155 1160 1165 

Ala Ala Frc Glu Thr Arg Arg Ser Thr Tyr Ser Ala Pro Arg Thr Gin 
117 0 11*75 1150 

15 Trp Arg Fhe Gly Ser Trp Thr Pre Cys Ser Ala Thr Cys Gly Lys Gly 
1185 1190 1195 125 0 

Thr Arg Met Arg Tyr Val Ser Cys Arg Asp Glu Asn Gly Ser Val Ala 
1205 HIO 1115 

20 

Asp Glu Ser Ala Cys Ala Tr.r Leu Fro Arg Fro Val Ala Lys Glu Glu 
1220 1225 1230 

Cys Ser Val Thr Fro Cys Gly Gin Trp Lys Ala Leu Asp Trp Ser Ser 
25 ' 1235 124C 1245 

Cys Ser Val Thr Cys Gly Gin Gly Arg Ala Thr Arg 31r. Val net lys 
1250 1255 126C 

3 0 Val Asn Tyr Ser Asp His Val He Asp Arg Ser Glu lys Asp Gin \s? 

1265 1270 12^5 1250 

Tyr He Pro Glu Thr Asp Gin Asp Cys Ser Met Ser Fro Cys Pre 31 n 
1285 1290 1195 

35 

Arg Thr Fro Asp Ser Gly Leu Ala Gin His Frc Phe Gin Asn Glu Asp 
13 0 0 13 0 5 1310 

Tyr Arc Fro Arc Ser Ala Ser Fro Ser Arg Thr Kis Val Leu Gly Gly 
40 1315 ^ 1320 1325 

Asn Gin Trp Arg Thr Gly Frc Trp Gly Ala Cys Ser Ser Thr Cys Ala 
HOC 1335 134 0 

4 5 Gly Ser Gin Arg Arc Val Val Val Cys C-ln Asp 31u Asn Gly 7yr 

H45 135: 135 5 HH 

Tr.r Al -i Anr. Aft Cys '. •. . Glu Aru I 1 e Lys V r Asp ?lu Gin A: g -.1 

1 3 G E * I • . H H 

• ■ ', 

Cys Glu Ser Gly Frc Cys Frc Gin Trp Ala Tyr Gly As:. Trp Gly Glu 

12 EC ' 13 65 13 50 

Cys Thr lys Leu Cys C-ly Gly Gly He Arg Thr Arg Leu Val Val Cys 

5 5 13 95 14 H 14 0 5 

Gin Arg Ser Asn Gly Glu Arg Fr.e ?r~ Asr Leu Ser Cys Glu : 1«: Ll-u 
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146 0 14 6 5 14 7 0 

Ser His Leu Glu Ser Asp Tyr Cys Lys His Leu Ala Lys ?ro His Gly 
1475 1460 1455 

5 

His Arc Lys Cys Arg Gly Gly Arg Cys Pro lys Trp lys Ala Gly Ala 
:45C 1495 15CC 

Trp Ser Gin Cys Ser Val Ser Cys Gly Arc Gly Val Gin C-ln Arg His 
10 1505 1510 1515 1520 

Val Gly Cys Gin He Gly Thr His Lys He Ala Arg Asp Thr Glu Cys 
1525 1550 * 1535 

15 Asn Pro Tyr Thr Arg Pro Glu Ser 31u Cys Glu Cys Gin Gly Pro Arg 
1540 1545 15EC 

Cys Pro Leu Tyr Thr Trp Arg Ala 31u Glu Ser Gin Glu Cys Thr Lys 
1555 1560 1565 

20 

Thr Cys Gly Glu Gly Ser Arg Tyr Arg Lys Val Val Cys Val Asp Asp 
157C 1575 1580 

Asn Lys Asn Glu Val His Gly Ala Arg Cys Asp Val Ser Lys Arg Fro 
251535 1590 15 95 16 00 

Val Asp Arg Glu Ser Cys Ser Leu Gin Pro Cys Glu Tyr Val Trp He 
1605 1610 1515 

3 0 Thr Gly Glu Trp Ser Glu Cys Ser Val Thr Cys Gly Lys Gly Tyr Lys 
1620 1625 163C 

Gin Arg Leu Val Ser Cys Ser Glu He Tyr Thr Gly Lys Glu Asn Tyr 
1635 1540 1645 

35 

Glu Tyr Ser Tyr Gin Thr Thr He Asn Cys Pro Gly Thr Gin Pro Fro 
165C 1655 1650 

Ser Val His Pro Cys Tyr Leu Arg Glu Cys Fro Val Ser Ala Tr.r Trp 
40 1665 1670 1675 1650 

Arg Val Giy Asn Trp Gly Ser Cys Ser Val Ser Cys Gly Val GLy Val 
1695 169C ' 15**5 

45 V~-i Gin Arg Ser Val Gin lys Lou Thr A sr. Hu Asp Gin Pro His 
HH 1H5 in: 

L^u 7y = H 1 f Tr.r Asp Leu Lys Ho Hu Hu Arg Lye, Tr.r lys; Arg Am 
H 1 5 HIT 17 2 5 

Val Tyr Asn Cys Glu Leu Pro Gin Asn Cys Lys Glu Val Lys A::g Leu 
17 3 C 1735 H 4 0 

Lys Gly Ala Ser Glu Asp Gly Glu Tyr Phe Leu Vet He Arg Gly Lys 
5 5 174 5 H5: :-5 5 I "6 0 

Leu Leu Lys He Pre Cys Ala Gly Ve: His Ser Asp His Fro Ivs Glu 
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Ser Fhe 31 r. Lys lie Arg He Asp Leu Thr Ser Met Gin lie He Thr 
1825 18 5 C 1 c 3 5 1 £ -i 0 

5 

Thr Asp Leu Gin Fhe Ala Arg Thr Ser Glu Gly His Fro Val Fro Fhe 
1643 1650 13S5 

Ala Thr Ala Gly Asp Cys Tyr Ser Ala Ala Lys Cys Fro Gin Gly Arg 
10 1&60 166S 1570 

Fhe Ser lie Asn Leu Tyr Gly Thr Gly Leu Ser Leu Thr Glu Ser Ala 
1375 13SC 1885 

15 Arg Trp He Ser Gin Gly Asr. Tyr Ala Val Ser Asp He Lys Lys Ser 
ieSD 18 95 1900 

Fro Asp Gly Thr Arg Val Val Gly Lys Cys Gly Gly Tyr Cys Gly Lys 
1905 1310 1915 192 0 

Cys Thr Pre Ser Ser Gly Thr Gly Leu Glu Val Arg Val Leu 
1525 1530 



